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SUMMARY  OF  REPORT 


The  report  consists  of  papers  written  in  six  research  areas. 
Most  of  these  papers  have  been  published  in  journals  or  conference 
proceedings,  or  are  soon  to  be  published.  A  few  are  taken  from 
earlier  semi-annual  reports. 

The  six  research  areas  are  Packet  Radio  Networks,  Adaptive 
Routing,  Algorithms,  Network  Design,  Switching  Techniques,  and 
Local  Area  Networks. 

In  the  area  of  Packet  Radio  Networks,  our  results  on  the 
throughput  analysis  of  Multihop  Packet  Radio  Networks  was  first 
presented  at  ICC'80  (A.l).  An  expanded  version  with  extensions  has 
been  accepted  for  publication  in  the  IEEE  Transactions  on  Communica¬ 
tions  (A. 2).  Further  extensions  are  presented  in  papers  A. 3,  A. 4, 
and  A. 5.  In  this  work  we  developed  an  algorithm  for  finding  the 
throughput  of  Multihop  Packet  Radio  Networks  with  arbitrary 
topologies,  sizes,  connectivities,  traffic  requirements,  and  routings. 
Details  of  the  algorithm  are  given  in  paper  A. 6. 

Our  work  on  Adaptive  Routing  was  first  presented  at  NTC'76. 
It  was  further  reported  on  in  the  IEEE  Transaction  on  Communications 
in  an  invited  paper  in  April  1981  (B.l).  Extensions  to  this  work  are 
presented  in  papers  B.2,  B.3,  B.6,  and  B7.  Our  approach  involved 
two  levels  for  adaptive  routing  -  a  slowly  varying  global  level  and  a 
dynamic  local  level.  This  partition  produced  a  stable  routing  that 
was  amenable  to  analysis.  We  showed  that  significant  performance 
improvements  were  obtainable.  A  control  theoretic  approach  applied 
to  traffic  networks  is  presented  in  papers  B.4  and  B.5. 


Our  work  in  the  area  of  algorithms  has  focused  on  finding  ap¬ 
proaches  to  solving  intractable  problems  which  arise  in  the  design  and 
analysis  of  networks.  Several  of  these  papers  (C.l,  C.2,  and  C.4) 
deal  with  the  solution  of  specific  fundamental  problems.  Others  (C.5, 
C.6)  deal  with  the  application  of  probabilistic  methods  to  the  solution 
of  a  broad  class  of  problems.  Finally,  in  C.3,  we  present  a  method 
of  directly  trading  computational  complexity  for  solution  accuracy  in 
the  solution  of  combinatorial  optimization  problems. 

Our  work  in  the  area  of  Network  Design  has  led  to  solutions  to 
specific  problems  in  several  areas  including  centralized  networks 
(D.l,  D.2),  and  circuit  switched  networks  (D.4,  D.5,  and  D.6).  It 
has  also  resulted  in  the  development  of  a  facility  location  algorithm 
(D.3)  which  has  been  applied  to  the  location  of  earth  stations  and 
packet  radio  repeaters  in  broadcast  networks. 

In  section  E,  we  present  our  work  on  an  Information  Theoretic 
approach  to  communications  in  networks  (E.l  and  E.2),  on  a  compari¬ 
son  of  packet  switching  and  character  switching,  including  an  optimum 
packetization  strategy  (E.3),  on  optimal  multiplexing  in  Integrated 
Switching  schemes  (E.4)  and  on  Integrated  Satellite  Access  (E.5). 

In  papers  F.l  and  F.2,  we  present  some  work  on  Local  Area 
Networks  for  which  we  evaluate  performance  and  assess  the  impart  of 
user  interface  architecture. 

Most  of  the  work  reported  upon  here  was  supported  by  the  US 
ARMY  (CECOM).  Portions  of  this  work  were  also  supported  by  the 
National  Science  Foundation,  by  Network  Analysis  Corporation,  by 
General  Telephone  and  Electronics,  and  by  IBM's  Federal  Systems 
Division.  Research  not  supported  under  this  grant  is  indicated  by  an 


asterisk  in  the  table  of  contents  and  is  included  to  complement  the 
other  research  reports  and  provide  a  more  complete  record  of  our 
activities. 
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THROUGHPUT  ASALYSIS  OF  MULTIHOP  PACKET  RADIO  NETWORKS* 


Robert  R.  Boorstyn  and 
Aaron  Kershenbaum 

Polytechnic  Institute  of 
New  York 
333  Jay  Street. 

Brooklyn,  New  York  11201 


Veil  Sahin 

Bell  Laboratories 
Holmdel,  New  Jersey  07733 


ABSTRACT 

We  consider  the  problem  of  obtaining  exact  expressions  for  throughput  and  blocking 
probabilities  in  multihop  packet  ratio  networks  operating  under  CSMA.  We  obtain  exact  results 
for  a  general  class  of  message  lengths,  for  general  topologies,  and  for  perfect  capture.  These 
results  are  obtained  by  assuming  perfect  acknowledgments. 


I.  INTRODUCTION 

We  consider  the  problem  of  obtaining  exact  expressions  for  throughput  and  blocking 
probabilities  in  multibop  packet  radio  networks  operating  under  carrier  sense  multiple  access 
(CSMA).  Procedures  are  developed  which  can  be  used  to  analyze  general  topologies  for  a 
general  class  of  packet  length  distribution.  Examples  of  chains,  rings,  and  stars  are  presented. 

II.  THE  NETWORK  MODEL 

We  consider  the  problem  of  analyzing  the  throughput  capability  of  a  multihop  packet 
radio  network  operating  under  carrier  sense  multiple  access  (CSMA).  Thus,  we  assume  that 
the  network  is  comprised  of  terminals  equipped  with  radio  transponders  suitable  for 
broadcasting  data  over  a  limited  distance.  In  general,  the  source  and  destination  terminals 

•  This  research  was  partially  supported  by  USARMY  CENCOMS  under  contract  DAAK  80-80-K-0579,  and  by  the 
National  Science  Foundation  under  grant  ENG-79-08210. 


cannot  hear  each  other  directly,  and  the  data  has  to  be  relayed  by  one  or  more  intermediate 
devices.  A  separate  set  of  devices,  called  repeaters,  may  exist  for  this  purpose,  or  the  terminals 
themselves  may  relay  messages  for  one  another. 

Control  of  the  network  is  completely  distributed,  i.e.,  no  station  or  central  control 
mechanism  is  assumed  to  exist.  Rather,  we  assume  that  each  source  terminal  has  prestored 
one  or  more  routes  to  all  destinations  and  includes  all  necessary  routing  information  in  the 
packets  if  transmits.  These  assumptions  are  made  merely  to  simplify  the  presentation.  In  fact, 
the  results  presented  are  valid  for  networks  using  alternate  routing  as  long  as  routing  changes 
are  not  made  over  short  time  intervals.  One  of  the  motivations  for  this  study  came  from  a 
consideration  of  the  design  of  routing  procedures  for  such  networks.  It  was  necessary, 
however,  to  first  develop  an  understanding  for  the  throughput  of  various  topologies. 

Exogenous  traffic  is  modeled  as  independent  Poisson  processes  arriving  at  each 
source  node,  with  appropriate  rates  and  packet  lengths.  The  topology  is  specified  by  a  listing  of 
which  terminals  (or  repeaters)  can  hear  each  other.  In  the  remainder  we  will  not  distinguish 
between  terminals  and  repeaters  and  will  refer  to  them  collectively  as  either  terminals  or  nodes. 
In  general  the  transmissions  of  one  terminal  can  be  heard  by  many  other  terminals.  The 
routing  will  specify  which  terminal  is  to  repeat  the  packet,  if  necessary. 

If  two  or  more  transmissions  are  simultaneously  heard  by  a  terminal  (called  a 
'collision')  at  least  one,  and  possibly  both,  is  'lost'  and  must  be  retransmitted.  We  assume 
retransmissions  are  scheduled  at  a  random  instant  in  time  sufficiently  far  in  the  future  so  as  to 
preserve  the  Poisson  nature  of  the  combined  traffic  stream,  which  now  consists  of  exogenous 
traffic  and  rescheduled  traffic.  For  this  study,  we  assume  that  a  packet  can  be  retransmitted  as 
many  times  as  is  necessary,  i.e.,  that  there  is  no  maximum  allowable  number  of 
retransmissions. 

At  any  time,  terminals  may  either  transmit  or  receive,  they  cannot  do  both 
simultaneously.  Before  transmitting,  a  terminal  senses  the  channel.  If  it  detects  that  any  of  its 


neighbors  (i.e.,  terminals  that  it  can  hear)  are  transmitting  (by,  e.g.,  sensing  a  carrier)  it 
reschedules  the  transmission  as  for  collided  packets  above.  If  at  the  scheduled  time  for  a 
transmission,  the  terminal  is  already  engaged  in  transmitting  a  packet,  the  new  packet  is  also 
rescheduled  as  above.  Thus  packets  are  continually  rescheduled  until  they  are  successfully 
delivered  to  the  next  terminal  on  their  route.  We  assume  that  the  total  stream  of  traffic 
scheduled  by  any  terminal  is  a  Poisson  process.  This  includes  originating  traffic  and  packets 
rescheduled  either  due  to  collisions  or  due  to  the  channel  having  been  sensed  busy.  This 
scheme  is  called  carrier  sense  multiple  access  (CSMA).  The  Poisson  assumption  is  valid  for  the 
assumptions  made  above  and  will  yield  accurate  results  for  throughput.  Compromises  will  have 
to  be  made,  however,  if  an  accurate  picture  of  time  delays  is  to  be  considered. 

It  is  possible,  due  to  non-zero  propagation  delay,  that  collisions  of  transmissions 
from  neighboring  terminals  may  still  take  place  despite  the  CSMA  strategy.  This  will  occur  if  a 
terminal  senses  the  channel  before  another  terminal’s  transmission  is  received.  This  effect  is 
small  if  terminals  are  reasonably  close  or  are  not  transmitting  at  high  speed.  We  will  ignore  this 
phenomena  here,  and  assume  that  all  transmissions  are  instantly  heard  by  their  neighbors. 

A  passive  acknowledgment  is  used  for  transmission  to  neighbors.  The  transmitting 
terminal  listens  to  the  channel  to  hear  if  a  packet  is  being  rebroadcast  by  a  neighbor.  If  after  a 
prespecified  time  interval,  the  transmitting  node  does  not  hear  the  packet  rebroadcast,  it 
retransmits  the  packet.  But  the  packet  may  have  been  successfully  received  by  the  neighbor 
even  though  the  originator  does  not  hear  the  rebroadcast.  Duplicate  packets  may  be 
transmitted  and  deleted  only  at  the  final  destination  or  they  may  be  detected  and  deleted  earlier. 
An  end-to-end  acknowledgment  is  returned  to  the  originator  from  the  final  destination.  In  this 
paper  we  assume  that  passive  acknowledgments  are  always  heard  and  ignore  the  effect  of  end- 
to-end  acknowledgments.  Alternately,  these  acknowledgements  could  have  been  added  to  the 
required  traffic.  (In  a  sequel  paper,  the  effect  of  passive  acknowledgements  will  be  studied). 


We  depict  the  topology  of  the  network  by  a  graph  where  terminals  are  represented 
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by  nodes.  The  nodes  are  connected  by  a  link  if  they  can  hear  each  other's  transmissions,  i.e.,  if 
they  are  neighbors.  As  an  example  see  Figure  1.  Node  A  can  hear  node  B,  but  not  node  C. 
Node  B  can  hear  both  nodes  A  and  C.  Node  C  can  hear  node  B,  but  not  node  A.  If  node  A  is 
transmitting  to  node  B  and  node  C  begins  transmitting,  then  the  transmission  from  A  to  B  may 
be  lost  depending  upon  the  'capture*  assumptions  we  make.  A  conservative  assumption  is  that 
the  A  to  B  transmission  is  lost  •  this  is  known  as  zero  capture.  Alternatively,  perfect  capture 
assumes  that  this  transmission  is  successfully  received.  Half-amplitude  capture  assumes  that 
the  transmission  is  lost  if  C  dominates  A  at  B.  This  can  happen  if  C  is  closer  to  B  than  A  is  to 
B,  or  has  a  greater  signal  strength  perceived  at  B  than  A  has.  If  A  dominates  C,  then  the 
transmission  is  successful.  However,  in  all  cases  of  a  collision  we  assume  the  later  transmission 
is  lost.  Thus  if  C  is  transmitting  to  B,  this  packet  is  lost  in  all  cases.  We  will  consider  only 
perfect  capture  situations  below.  Note  that  under  CSMA  if  node  B  is  transmitting,  neither  A  or 
C  is  allowed  to  transmit. 

We  assume  that  a  routing  has  been  specified.  This  takes  the  form  of  deciding  which 
of  the  neighbors  are  to  rebroadcast  a  packet  from  a  particular  source  to  a  particular  destination. 
Thus  the  amount  of  traffic  that  a  terminal  wishes  to  send  to  its  neighbor  can  be  computed.  If 
these  rebroadcast  packets  are  scheduled  at  a  random  time  far  in  the  future  the  Poisson 
assumption  for  traffic  streams  is  preserved.  We  assume  that  the  traffic  between  neighbors  is 
specified  and  form  independent  Poisson  processes.  We  assume  that  the  packet  length  is 
reassigned  independently  at  each  hop.  This  is  analogous  to  the  'independence  assumption’  in 
queuing  networks. 

The  details  of  CSMA  for  a  single  bop  network  can  be  found  in  the  papers  by  Tobagi 

and  Kleinrock^^V  Tobagi  has  also  developed  some  simple  models  for  two-hop  networks^^V 

Details  of  a  packet  radio  network  can  be  found  in  a  paper  by  Kahn^^V  A  discussion  of  routing 

(4) 

in  multihop  packet  radio  can  be  found  in  the  paper  by  Gitman,  Van  Slyke,  and  Frank'^  ’ .  An 
earlier  version  of  this  paper  was  presented  at  1CC'80(S).  More  details  can  be  found  in  the 
thesis  of  Sahin(6). 
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III.  GENERAL  RESULTS 

In  this  section  we  develop  some  expressions  that  are  valid  for  the  packet  radio 
network  we  have  modeled  above  using  CSMA  and  with  an  arbitrary  packet  length  distribution 
Let  i  be  a  node,  n,  one  of  its  neighbors,  N*  the  set  of  all  the  neighbors  of  i,  and  Ni  the  set  of 
all  i’s  neighbors,  including  i.  Let  gi  be  the  total  rate  (in  packets/sec)  of  all  scheduled  traffic  at 
node  i.  This  includes  originating  traffic  and  all  rescheduled  traffic  and  is  assumed  to  be  Poisson. 
Let  I/m,  be  the  average  length  of  packets  transmitted  by  node  i.  Let  G,  =  i,/M,  be  a 
normalized  rate. 

Node  i  is  either  busy  (transmitting)  or  idle.  It  will  transmit  a  scheduled  packet  if  at 
the  instant  it  is  scheduled  all  nodes  in  TV,  are  idle.  Let  A  be  a  set  of  nodes.  Let  P(A)  be  the 
probability  that  at  a  random  instant  all  nodes  in  A  are  idle.  The  nodes  not  in  A  may  or  may 
not  be  idle.  Similarly  P(i),  P(i,A),  P(A,B)  are  the  probabilities  that  i  is  idle,  node  i  and  nodes 
in  A  are  idle,  and  all  nodes  in  A  and  B  are  idle. 

Since  traffic  is  scheduled  at  node  i  with  a  Poisson  rate  g,  ,  will  be  transmitted  only  if 
TV,  is  idle,  and  transmissions  have  average  length  1/m,,  the  probability  that  i  is  busy  is  given  by 

l-P(i) -C,P(TV,)  (1) 

If  i  is  busy  then  under  CSMA,  r,  must  be  idle.  Then  since 

/’('!()  -  P(n,,i)  +  P(n,\i  iMsyKl-PI/)] 
and  P(r,Ii  busy)  ”  1,  we  have 

P{n,.i)  ~  Pin,)  +  P{i)  -  \  C) 


Similarly,  if  A  C  TV,*, 
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P[A  .!  )  -  P(A  )  -t-  P(i)  -  1 
letting  A  =  in  equation  t3),  and  using 


(31 


P(.\")  =  P(.\’,)/Pii\N‘) 


we  get 


(4) 

Equation  (4)  is  often  found  in  CSMA  literature. 

A  packet  from  i  to  /i,  will  be  transmitted  when  it  is  scheduled  if  jV,  is  idle.  During 
the  transmission  all  nodes  in  A,'  will  be  idle.  It  will  be  successfully  received  at  «,  if  all 
neighbors  of  n,  not  in  A,  are  also  idle  at  the  beginning  of  transmission.  Otherwise  a  collision 
will  occur.  Let  jj  be  the  rate  in  packets/sec,  determined  by  the  routing  and  assumed  Poisson, 
of  the  traffic  that  i  wishes  to  send  to  n,.  This  is  the  required  throughput  or  offered  traffic.  Let 
g, ,,  be  the  rate  of  all  scheduled  traffic  from  i  to  n,.  We  have  also  assumed  that  all  these 
streams  are  Poisson  and  independent.  Of  these  g,  „,  packets  per  second,  s,  „  must  be 
successful.  Thus 


g.. 


/’(A,  .A,,) 


(5) 


where  S,_„^  =  and  G,  „  =  g,  „/m, 

The  total  scheduled  traffic  ( norn.ali/ed)  at  a  node  is  given  by 


(6) 


From  equations  '1)  through  (6l  wc  *i.:i  oi  derive  a  relation  between  the  5,  and  G,,  and 


>0 
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determine  the  maximum  S',  the  network  can  support.  This  we  call  the  (maximum) 
throughput  or  capacity.  In  the  next  section  we  develop  this  relationship  for  exponential  packet 
lengths  and  arbitrary  topologies.  Later  we  extend  the  analysis  to  a  general  class  of  packet 
length  distributions. 

IV.  EXPONENTIALLY  DISTRIBUTED  PACKET  LENGTHS 

If  the  packet  lengths  are  exponentially  distributed,  then  the  system  can  be  viewed  as 
a  Markov  process  where  the  states  are  identified  by  which  nodes  are  idle  and  which  are  busy. 
Let  D  be  a  set  of  busy  nodes.  Because  of  CSMA,  no  nodes  in  D  may  be  neighbors  of  each 
other.  Let  Q(D)  be  the  probability  that  at  an  instant  of  time,  all  nodes  in  D  are  busy,  and  all 
nodes  not  in  D,  are  idle.  Then  each  set,  D,  represents  a  state  in  a  Markov  system,  and  Q(D)  is 
the  state  probability.  In  particular,  the  null  set  D  *  represents  the  state  that  all  nodes  are 
idle. 

Assume  the  system  is  in  state  D.  It  will  leave  the  state  if  any  iiD  stops 
transmitting.  This  happens  with  rate  Thus  the  transition  to  state  {D-i}  occu’-s  with  rate  fi/. 
The  only  other  way  to  leave  state  D  is  for  one  of  the  idle  nodes  that  is  not  a  neighbor  of  any 
i(D  to  begin  to  transmit.  This  occurs  with  rate  gj.  Let  Np  be  the  set  of  all  neighbors  of  all 
nodes  in  D.  Then  the  transition  from  D  to  {D+j},  JtNp,  occur'  with  rate  gj.  The  global 
balance  equations  for  this  system  are 

(Sm,  +  2  2  M;G(D+y)  (7) 

‘•D  j.  0  Np  ^  <^0 

where  D  is  one  of  the  special  sets  defined  above. 

It  is  easy  to  see  that  the:>c  equations  are  satisfied  by 

Q(D)~  —  Q(D-i)  -  G,Q(D-i),  itD  (8) 

fti 


V# 


G(Z))  -(n  c,)  Q(4>) 

!«/) 


where  we  adopt  the  convention  that  n  G,  *  I.  Summing  over  all  D*  we  get 

<<♦ 


S  Q(D)^  2  In  C(«)]-1 


In  the  previous  section  we  found  we  were  interested  in  quantities  like  P(A),  where 
A  is  any  set  of  nodes,  and  P(A)  is  the  probability  that  all  nodes  in  A  are  idle,  and  all  nodes  not 
in  A  may  or  may  not  be  idle.  This  can  be  found  by  summing  Q(D)  over  all  sets  D  that  do  not 
contain  nodes  in  A.  Thus 


2  (nc*) 

DCA‘ 

2  (DC,) 

DCN 


where  DCA^  refers  to  all  such  sets  contained  in  the  complement  of  A  and  N  is  the  set  of  all 
nodes.  We  adopt  the  shorthand  notation. 


SP{B)  -  2  (  n  G.) 


where  SP  refers  to  sum  of  products.  Thus 


P{A)  ~SP{A^)/SP(N) 


Equations  (5),  (6).  and  (11)  can  be  used  for  any  topology  to  generate  the  solution 
to  our  problem.  The  equations  relating  the  5,  , .  and  can  be  solved  iteratively.  For 


example,  equation  (5)  now  becomes 
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S,.,,  S/>{(N,+N,)M 

'g~  ~SP(N) 


where  by  A  +  B  we  mean  the  union  of  A  and  B. 

Evaluation  of  sums  of  products  in  equation  (12)  are  made  easier  by  the  following 
two  rules.  Consider  two  sets  of  nodes  A  and  B  such  that  no  node  in  A  can  hear  any  node  of  B. 
Then 


SP(A+B)  ~SP(A)  SP(B),  ADB  ^</> 


(15) 


Also, 


SP(A)  -SPiA-i)  +  CiSP{A-Nt),  UA  (16) 

To  prove  these  rules  just  consider  all  products.  We  have  successfully  evaluated  many  complex 
topologies  with  these  procedures. 

There  are  other  relations  which  will  be  found  useful  in  extending  our  model  to  more 
complex  situations.  We  prove  some  of  these  below.  Let  C  be  a  cut,  i.e.,  a  set  of  nodes  that 
divides  the  network  into  three  parts  A,  B,  and  C,  where  A  and  B  have  no  neighbors  in  common 
as  in  equation  ( IS).  Let  A^A  i  +  A2,  B  B i -h  B2  where  A  1A2  ^  B 1B2  “  (P  Then 


P(A„C,B,)  SP(A2-I-B2)  SP(A2) 

P(C,B,)  ~  SP(A+B2)  “  SP(A) 


But 


PM  ic.  SP(A2+B)  SP(A2) 

^  P(C)  SP(A-I-B)  ~  SP(A) 


Thus 


V.  A  GENERAL  CLASS  OF  PACKET  LENGTH  DISTRIBUTIONS 

In  this  section  we  extend  the  results  just  proven  to  include  a  general  class  of 
distributions  for  the  packet  length.  We  will  show  that  the  procedures  developed  for  perfect 
capture  are  independent  of  packet  length  distribution.  To  prove  this  we  start  with  a  simpler 
extension.  In  the  above  we  assumed  that  all  packet  lengths  are  exponentially  distributed  and 
have  the  same  mean  when  transmitted  by  a  node  to  any  of  its  neighbors.  Different  nodes  may 
transmit  different  average  length  packets,  however.  Now  assume  that  while  all  packet  lengths 
are  still  exponentially  distributed,  the  average  length  packet  transmitted  from  a  node  may  be 
different  to  each  of  its  neighbors.  This  will  be  useful  in  analyzing  different  protocols  (to  be 
presented  in  a  sequel  paper)  but  is  presented  here  as  the  first  step  in  the  desired  extension. 

Now  the  state  of  the  network  depends  upon  who  is  transmitting  and  to  whom.  We 
can  keep  the  same  structure  by  breaking  every  node  into  a  set  of  'micronodes',  one  for  each 
neighbor.  These  nodes  may  be  indexed  by  (/,n,).  If  i  is  transmitting  to  «,  then  this  node  is 
active,  otherwise  it  is  idle.  Micronodes  are  connected  in  our  topology  if  they  can  hear  each 
other.  Since  CSMA  still  prevails,  all  micronodes  for  a  given  node  are  fully  connected 
Furthermore  ail  micronodes  for  nodes  that  are  connected  in  the  original  topology  are  also  fully 
connected.  The  analysis  now  proceeds  as  above  since  the  Markovian  property  has  been 
maintained.  For  example,  equation  (14)  still  holds,  but  now  A,  and  are  collections  of 


9 


«» 


•vj 
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micronodcs.  Note  that  N,  contains  the  full  set  of  micronodes  for  i  and  all  neighbors  of  i  5,  , 
and  G,  ,,  have  the  same  meaning  as  before.  However  they  are  normalized  by  the 

average  packet  length  for  packets  going  from  i  to  n, .  The  terms  in  the  sums  of  products  are 
Gt  „^  for  the  micronodes. 

Let  A  contain  full  sets  of  micronodcs  and  include  the  node  i.  Then  from  equation 

(16) 


+  G,  „  SP(A  -N, )  (20) 

Here  we  have  used  the  fact  that  and  the  notation  that  N/  is  the  set  of  all 

micronodes  of  i  and  neighbors  of  i.  Since  this  is  equivalent  to  the  original  set  of  node 
neighbors  of  i  we  keep  the  same  notation.  Repeating  for  all  neighbors  of  i  we  get, 


If  we  let 


S/>(/l)  -5/»(/l-i)  +  (2  Gt,,^)SPiA-N,) 


(21) 


Ml  It  '  «  '  ' 


then  equation  (16)  is  preserved.  In  a  similar  manner  all  previously  derived  equations  can  be 
maintained  where  G,  takes  on  the  definition  in  equation  (22).  Here  I/mi  is  an  average  packet 
length,  averaged  over  the  different  average  packet  lengths  to  different  neighbors  in  proportion 
to  their  scheduled  rate. 

We  now  prove  our  main  theorem  for  this  section.  Assume  that  the  length  of 
packets  transmitted  from  i  to  rii  has  the  density 


~  J*- 


(23) 
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where 


2 

j 


a.. 


2:  0  and  2  “  1 


Thus  the  length  is  distributed  as  a  positive  sum  of  exponentials.  Another  way  of  looking  at  this 
is  that  the  a’s  are  the  probabilities  of  choosing  the  associated  exponential  density.  Now  create  ^ 

micronodes  for  each  triple  Here  we  use 

^i^,j  The  micronodes  for  some  i  and  any  are 

fully  connected  as  are  the  micronodes  for  neighboring  nodes.  Equations  (20)  and  (21)  now 
become 


ind 


S/*(/l) 

S/«(/l) 


-S/>U  -  (i.n„j)]  +  Gt  „^jSP{A-Si) 


~SP{A-i)  + 


2 

HfJ 


(24) 

(25) 


In  the  same  manner  as  above,  we  let 


G, 


II 


2 


Si  .i>i  ^ 


-iJ 


2 


2 


s,.. 


2 


These  equations  are  used  to  find  P(A)  in  terms  of  C,  and  are  identical  in  form  to  those  derived 
for  exponentially  distributed  packet  lengths. 


Equation  (5)  in  turn  comes  from 
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Summing  over  j,  we  get  equation  (5) 


Si..,  -  S  -  c„^/*(^,+a;,) 


Thus  all  relations  between  the  G's  and  the  S’s  are  preserved.  The  actual  nature  of  the  problem 
is  taken  into  account  by  the  relationship  between  the  S's  and  the  normalization  by  the  n's. 

We  can  now  restate  the  above  theorem.  Let  i  be  any  node  and  j  tN‘ ,  the 
neighborhood  of  i.  Let  s,j  be  the  successful  (desired)  rate  from  i  to  j,  in  packets/sec.  Let  the 
density  of  the  packet  lengths  be 


ftjix)  -  2  Oij.kH'ij.ktxp  “'''•*  fwhere  m.j,*  >  0,  0, 

k 

and  2  fl/j.*  *  1- 


Let  1/m/j  =  2  ^ij.k^>*ijjc  average  packet  length,  in  seconds.  Let  Sjj  =  Sij/njj.  Then 

k 


“  P{NiUNj)  =  a  function  of  ■  ■  ■  ) 


where 


-  2  C..; 


Proof: 


let  ^,j.k  o,.].kS,. 


Then  -  P{N, 


where 


G, 


2  G,j  =22 
JtS'  J,s'  * 


.* 


Dividing  5,^  *  by  Mi  j.*  and  summing  over  k,  we  get 


VI.  EXAMPLES  OF  THE  PROCEDURE 

As  an  example  consider  the  chain  of  four  nodes  shown  in  Figure  2.  We  assume 
5i2=“S2i  =  S23“-S32”S34  =  ‘S^43”‘S^  for  simplicity,  and  perfect  capture.  Also  note  that 
C  12  “  Cl.  (f43  —  <#4  and  by  symmetry  <7i  ■■  C4,  C21  =  C34,  and  Gu^Gn.  Also  from 
equation  (6),  G^  —  <?2i  +  23  “  Gy  From  equation  (5),  we  have 

-  P(1.2.3)  -  7?-  =  P(1.2.3,4) 

O,  O21  &23 

But 

sp{N)  -  2  (nc.)  -  1  -I-  c,  +  C2  +  c;3  +  (74  +  c,C3  +  Gfi^  +  g,g^ 

ItD 

-  1  +  2(7,  +  2C2  +  2G1G2  +  (7,^  *  A 


and 


P(l, 2.3,4)  -  l/A,  P(l,2,3)  »  (!+C4)/A  -  (1+C,)/A 


Solving,  we  get 


C2  -  C,(2+C,)  andS  -=  C,(l+Ci)/A  , 


or 


S  -  (7,(1+C,)/(1+6C,+7G,^+2C,M. 


Wc  can  now  find  the  maximum  value  of  S  possible,  the  throughput  of  the  chain,  which  is  .128. 
obtained  when  C  i  =  0.71 . 

In  general  the  equations  cannot  be  solved  as  simply  as  for  this  four  node  example. 
Equation  (11)  is  used  to  get  expressions  for  P(A)  in  terms  of  G,  .  Then  equation  (14)  is  used 
to  get  expressions  for  S,  in  terms  of  C,  and  P(A).  Equation  (6)  provides  the  relation 
between  G,  and  G,,,,^.  The  5,,,^  are  found  from  the  offered  traffic,  the  routing,  and  other 
assumptions  and  are  considered  as  inputs.  The  equations  are  iterated  until  a  solution  of  G's  for 
a  set  of  S's  is  found.  The  maximum  set  of  S's  possible  is  considered  the  throughput,  or 
capacity,  of  the  network.  For  some  modest  size  problems,  as  above,  the  equations  can  be 
solved  directly. 

As  a  second  example  consider  the  star  topology  shown  in  Figure  S.  Here  assume 
there  are  L  legs  of  N  2  nodes  each.  Denote  the  center  node  by  0,  the  nodes  one  hop  out  by 
1,  and  the  other  nodes  by  2.  Further  assume  symmetrical  traffic  in  the  nodes  and 
•Soi  *  •S'lo  “  -S’!:  ”  *521  “  S.  Then  LS  is  the  total  traffic  successfully  transmitted  by  node  0.  The 
equations  are 


-  P{all  but  L-\  §1  nodes ) 

Co  Gio  Cjo 


5P(£-1#  2  nodes) 
5P(A^) 


-  (1+C2)^-7a 


-|--=P(0.1,2) 


SPiL-\  legs) 
SP{N) 


(1+C,+C2)^"'/A 


where 


A-SP(yV)  -d+Gi+Cj)^  +Go(1+G2)^ 

C|  -  Cio  +  Gi2 

But  C 10  “  Co/L  and  C  ,2  “  C2.  so  G  |  -  C2  +  Gq/L.  Thus  wc  have  two  equations: 


« 


LS/Co-  (1+C2)^''M  and  S/G2  “  {\-^2G 2+go/L)‘--'/A 


For  any  5  <  So,,,  they  can  be  solved  for  Go  and  Cj.  Alternatively  for  any  Go  we  can  find  the 
corresponding  Gj  by  solving 

Go(1+<'2)‘"'  “  G2(l+2G2+G,y't)^"'  . 

Then  the  relation  between  S  and  G2  can  be  studied.  We  then  find  the  maximum  S, 
possible.  LSom.  the  maximum  throughput  of  the  star. 

For  larger  problems  we  will  get  several  equations  of  the  form 

Co-LSMH-G2)^“' 

Gj  -  SA/(1+2G2+Go/Z.)'-"' 

For  any  S  we  solve  these  iteratively.  Since  Go  S  LS,  G\  ^  25,  and  G2  S  5,  the  lower 
bounds  are  good  starting  points  for  G,.  For  S  sufficiently  less  than  Sbo  we  have  found  the 
iteration  converges  monotonically  and  rapidly.  As  S  approaches  Smax  from  below  the 
convergence  is  still  monotonic  but  slows  appreciably.  For  5  >  Squx  the  iteration  does  not 
converge  and  often  diverges  dramatically.  We  have  uncovered  no  serious  numerical  problems 
with  this  procedure  in  the  many  examples  we  have  evaluated. 

VII.  NUMERICAL  EXAMPLES 

We  consider  here  three  different  topological  structures  with  exponentially  distributed 
packet  lengths  and  perfect  capture.  We  assume  all  ■■5,  for  all  i,  and  take  full  advantage  of 
symmetry.  The  three  topologies,  shown  in  Figures  3,  4,  and  5,  are  a  chain,  a  ring,  and  a  sur, 
all  with  various  lengths.  In  each  case  we  find  the  maximum  throughput,  S.  These  are  given  in 
Table  I. 

The  maximum  one  way  throughput  for  a  long  chain  is  S  .086.  This  throughput  is 
approached  when  the  lengths  of  chains  exceed  10.  For  smaller  length  chains  the  throughput  is 
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higher.  In  CSMA  transmissions  of  neighbors  may  not  overlap  in  time.  Since  each  nodes 
transmits  successfully  2S  packets  per  average  packet  transmission  time,  then  we  must  have  S  :< 
1/5.  The  throughput  is  slightly  s<net||Ar  than  half  of  this  limit.  The  cause  of  the  reduction  is 
collisions  from  transmissions  two  hops  away,  the  so-called  ’hidden  terminals’.  The  throughput 
of  S  *  .086  for  a  chain,  although  the  maximum  possible,  is  not  a  useful  operating  point.  As  in 
ALOHA,  this  is  the  point  at  which  delays  become  infinite  and  the  system  is  unstable.  The 
network  would  have  to  be  operated  at  some  lower  level. 

It  is  instructive  to  compare  the  performance  of  multihop  CSMA  with  that  of  slotted 
ALOHA.  Let  p  be  the  probability  of  transmission  in  one  direction  at  a  node.  Then  S  p(l- 
2p)  for  a  long  chain.  here  is  .074  which  is  approximately  14%  less  than  that  for  CSMA. 
There  are  two  factors  working  here.  CSMA  will  produce  less  collisions  since  neighbors  will  not 
interfere  with  each  other.  Hidden  terminals  will  still  produce  collisions.  (All  terminals  arc 
hidden  in  ALOHA).  But  CSMA  prohibits  possible  successful  transmissions.  For  instance, 
node  3  can  transmit  successfully  to  node  2  while  node  4  is  transmitting  successfully  to  node  5. 
This  is  possible  in  ALOHA  but  prohibited  in  CSMA.  This  is  one  of  the  prices  paid  to  control 
collisions. 

We  note  that  for  a  ring  greater  than  7  nodes  the  maximum  throughput  is  the  same 
as  that  for  a  long  chain.  This  is  expected  since  the  co-ngestion  is  now  in  the  middle  and  it  is 
unimportant  whether  or  not  the  chain  is  closed.  A  star  with  two  legs  is  just  a  chain  with  N  for 
the  star  replaced  by  2N+ 1  for  the  chain. 

Consider  the  star  configuration  as  representing  the  center  node  (0)  trying  to  transmit 
to  some  node  or  nodes  far  away  via  many  repeaters.  For  one  leg,  the  maximum  rate  is  .086. 
For  two  legs,  the  maximum  rate  is  2S  or  .172,  exactly  twice.  The  results  are  shown  in  Table  11 
where  the  throughput  of  the  center  node  is  given  by  LS.  We  see  from  Table  II  that  whereas 
the  throughput  doubles  for  L  2,  it  increases  only  by  20%  when  L*3,  by  4-5%  further  when 
L*>4,  and  by  2%  when  L~5.  Congestion  at  the  central  node  is  limiting  its  ability  to  increase  its 


throughput.  Additional  legs,  beyond  three,  are  not  really  helpful. 


We  have  investigated  ways  of  reducing  the  congestion  at  the  center  node.  For  larger 
L,  the  traffic  in  each  leg  is  limited  by  congestion  at  the  center.  The  collisions  that  cause  most 

problems  are  for  transmissions  from  the  first  level  of  nodes  (I,  N+1,  2N-1-1 .  (L-1)N+1)  to 

the  center.  These  are  collided  with  by  other  first  level  nodes.  To  reduce  these  collisions  we 
considered  connecting  the  first  level  nodes  in  a  ring  and  then  fully  connecting  them.  These 
results  are  also  summarized  in  Table  I.  When  the  first  level  is  unconnected  the  throughput 
saturates  at  .229.  When  the  first  level  is  fully  connected  the  throughput  with  9  legs  is  .252,  a 
15%  increase  For  four  legs,  the  ring  connected  topology  is  best,  providing  some  compromise 
between  reducing  collisions  and  allowing  simultaneous  transmissions. 

The  best  that  we  can  expect  in  the  fully  connected  case  is  LS  <  1/3.  This  is 
because  all  transmissions  from  the  center  node  and  all  first  level  nodes  cannot  overlap.  We  will 
discuss  asymptotic  results  with  even  larger  stars  and  chains  in  the  next  section. 

VIII,  ASYMPTOTIC  RESULTS 

We  are  interested  in  asymptotic  results  for  several  reasons.  They  provide  us  with 
the  limiting  behavior  of  the  finite  networks  previously  studied.  Since  the  behavior  of  these 
networks  seems  to  converge  rapidly  with  their  size,  if  asymptotic  results  are  easier  to  obtain, 
they  would  be  useful.  We  are  also  interested  in  very  large  networks.  A  final  reason  is  to  verify 
some  of  the  bounding  arguments  on  throughput  made  in  the  last  section. 

We  first  consider  an  infinitely  long  chain.  We  let  S,,,+i  “  ”  5.  Then  all  nodes 

are  identical.  Also  C/,,+i  “  C/j-i  ”  G,72.  Thus  with  C,  ”  C, 


2S 

G, 


2S 

G 


P(i-i,  I,  1  +  1,  t+2) 


5P(— oo i—2)SP(i+3 oo) 

SP{— CO oo) 


(29) 


We  can  write  the  denominator  as 


S/»(-oo.....oo)«5/»(-oo,  •  ■  . <x>)+G.SP(-<x> . /-2)S/>(/ +2.. 


Now  let  Qji 


SP(i—k  ,...,co) 
SPU . oo) 


SP  {—CO . / ) 


Wc  observe  that  >  1 


converges  is  independent  of  i.  Then  equation  (29)  becomes 
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G  “  QiQi+GQ, 

But 

.  . . -  e.-,  +  oe,-, 

SP{—oo . j)  ^ 

andQo”*!-  Thus  ”  Gi  Therefore 


2S/G 


1 

Gi(Gi  +  2C) 


or 


5 


G 

2QiiQi+2G) 


We  note  that  since  Q-i 


from  equation  (31)  we  have 


G  -Gi(Gi-i) 


Finally  we  have 


S 


g-1 

2Gi(2Q  -1) 


.Gi  2:  1 


■  •.oo). 

for  k  >0  and  if  it 

(30) 

(31) 


(32) 

(33) 

(34) 


The  maximum  value  of  S  is  .086,  reached  when  0|  “  1.7  or  (7  *  1.2. 
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IX.  EXTENSIONS  AND  CONCLUSIONS 

We  have  presented  a  simple  but  fairly  realistic  model  of  a  muitihop  packet  radio 
network  and  have  obtained  maximum  throughputs  for  general  topologies  and  packet  lengths. 
We  have  assumed  perfect  reception  of  acknowledgments  and  have  not  included  additional  traffic 
due  to  end-to-end  acknowledgments.  Some  aspects  of  acknowledgments  can  be  included  by 
increasing  the  required  traffic.  We  are  investigating  the  effect  of  imperfect  acknowledgments 
and  different  retransmission  strategies.  The  model  should  still  be  useful  under  these 
extensions. 
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Tabu  I 


Maximum  One-Way  Throughput  (S) 


Number 

of 

Nodes,  N 

Chain 

Ring 

L-2 

Star 

Number  of  Legs,  L 

L-3  L-4  L-5 

1 

1.000 

1.000 

.167 

.103 

.074  .058 

2 

.500 

.500 

.111 

.076 

.057 

3 

.167 

.167 

.097 

.072 

.055 

4 

.128 

.073 

.092 

.070 

.054 

5 

.111 

.100 

.069 

.054  .044 

6 

.102 

.083 

7 

.097 

.087 

8 

.094 

.085 

9 

.092 

.086 

10 

.091 

.086 

00 

.086 

.086 

.086 

.069 

.054  .044 

Table  II 


( 

Maximum  One-Way  Total  Throughput  of  the 
Central  Node  in  a  Large  Star  Arrangement 


Maximum  Throughput,  LS 
Center  Arrangement 

Number  of  Legs  Unconnected  Ring-Connected  Fully  Connected 


1 

.086 

2 

.172 

3 

.207 

.198 

.198 

4 

.216 

.228 

.216 

5 

.220 

.230 

.230 

6 

.220 

.240 

7 

.245 

8 

.248 

9 


.252 
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ABSTRACT 

We  consider  the  problem  of  obtaining  exact 
expressions  for  throughput  and  blocking  proba¬ 
bilities  in  multihop  packet  ratio  networks  operat¬ 
ing  under  CSMA.  We  obtain  exact  results  for 
exponential  message  lengths  for  general  topologies 
and  for  constant  packet  lengths  for  simple  topol¬ 
ogies.  Both  results  are  obtained  by  assuming 
perfect  acknowledgments . 


r.  INTRODUCTION 

We  consider  the  problem  of  obtaining  exact 
expressions  for  throughput  and  blocking  proba¬ 
bilities  in  multihop  packet  radio  networks  operat¬ 
ing  under  carrier  sense  multiple  access  (CSMA). 
By  considering  exponential  packet  lengths,  proce¬ 
dures  are  developed  which  can  be  used  to  analyze 
general  topxilogies.  Examples  of  chains,  rings, 
and  stars  are  presented.  Extensions  to  non¬ 
exponential  and  fixed  packet  lengths,  and  more 
complex  models  are  discussed . 


11.  THE  NETWORK  MODEL 

We  consider  the  problem  of  analyzing  the 
throughput  capability  of  a  multihop  packet  radio 
network  operating  under  carrier  sense  multiple 
access  (CSMA).  Thus,  we  assume  that  the  net¬ 
work  is  comprised  of  terminals  equipped  with 
radio  transponders  suitable  for  broadcasting  data 
over  a  limited  distance,  in  general,  the  source 
and  destination  terminals  cannot  hear  each  other 
directly,  and  the  data  has  to  be  relayed  by  one 
or  more  intermediate  devices.  A  separate  set  of 
devices,  called  repeaters,  may  exist  for  this 
purpose,  or  the  terminals  themselves  may  relay 
messages  for  one  another. 

Control  of  the  network  is  completely  distri¬ 
buted,  i.e.,  no  station  or  central  control  mechan¬ 
ism  is  assumed  to  exist.  Rather,  we  assume  that 
each  source  terminal  has  prestored  one  or  more 
routes  to  all  destinations  and  includes  all  neces¬ 
sary  routing  information  in  the  packets  it  trans¬ 
mits.  These  assumptions  are  made  merely  to 
simplify  the  presentation.  In  (act,  the  results 
presented  are  valid  for  networks  using  alternate 
routing  as  long  as  routing  changes  are  not  made 
over  short  time  intervals.  One  of  the  motivations 
for  this  study  came  from  a  consideration  of  the 


design  of  routing  procedures  for  such  networks 
It  was  necessary,  however,  to  first  develop  an 
understanding  for  the  throughput  ol  various 
topologies. 

Exogenous  traffic  is  modeled  as  independent 
Poisson  processes  arriving  at  each  source  noiic. 
with  appropriate  rates  and,  initially,  exponcniiallv 
distributed  packet  lengths.  The  topology  i.s 
specified  by  a  listing  of  which  terminals  (oi 
repeaters)  can  hear  each  other.  In  the  remainder 
we  will  not  distinguish  between  terminals  and  re¬ 
peaters  and  will  refer  to  them  collertively  as 
either  terminals  or  nodes.  In  general  the  trans¬ 
missions  of  one  terminal  can  be  heard  by  many 
other  terminals.  The  routing  will  specify  which 
terminal  is  to  repeat  the  packet,  if  necessary. 

If  two  or  more  transmissions  are  simultan¬ 
eously  heard  by  a  terminal  (called  a  "collision") 
at  least  one,  and  possibly  both,  is  "lost"  and 
must  be  retransmitted.  We  assume  retransmis¬ 
sions  are  scheduled  at  a  random  instant  in  time 
sufficiently  far  in  the  future  so  as  to  preserve 
the  Poisson  nature  of  the  combined  traffic  stream, 
which  now  consists  of  exogenous  traffic  and 
rescheduled  traffic.  For  this  study,  we  assume 
that  a  packet  can  be  retransmitted  as  many  times 
as  Is  necessary,  i.e.,  that  there  is  no  maximum 
allowable  number  of  retransmissions. 

Terminals  may  either  transmit  or  receive, 
they  cannot  do  both  simultaneously.  Before 
transmitting,  a  terminal  senses  the  channel.  I!  it 
detects  that  any  of  its  neighbors  (i.e.,  terminals 
that  it  can  hear)  are  transmitting  (by.  e.q., 
sensing  a  carrier)  it  reschedules  the  transmission 
as  for  collided  packets  above.  If  at  the  sched¬ 
uled  time  for  a  transmission,  the  terminal  is 
already  engaged  in  transmitting  a  packet,  the  new 
packet  is  also  rescheduled  as  above.  Thus  pack¬ 
ets  are  continually  rescheduled  until  they  are 
successfully  delivered  to  the  next  lenninal  on 
their  route.  We  assume  that  the  total  stream  of 
traffic  scheduled  by  any  terminal  is  a  I'oisson 
process.  Thus  includes  originating  trallic  .md 
packets  rescheduled  either  due  to  collisions  or 
due  to  the  channel  having  been  sensed  busy 
This  scheme  is  called  carrier  sense  multiple  access 
(CSMA).  The  Poisson  assumption  is  valid  lor  ihr 
assumptions  made  above  and  will  yield  accurate 
results  for  throughput.  Compromises  will  have  to 
be  made,  however,  if  an  .accurate  picture  of  tirao 
delays  is  to  be  considered 
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It  is  possible,  due  to  non-zero  propagation 
(Icl.iy,  that  collisions  of  transmissions  from  neiqh- 
bonncj  terminals  may  still  lake  place  despite  the 
i'SMA  strategy.  This  will  occur  if  a  terminal  senses 
the  channel  before  another  terminal's  transmission  is 
received.  This  effect  is  small  if  terminals  are 
reasonably  close  or  are  not  transmitting  at  high 
speed.  We  will  ignore  this  phenomena  here,  and 
assume  that  all  transmissions  are  instantly  heard  by 
iheir  neighbors. 

A  passive  acknowledgment  is  used  for  trans¬ 
mission  to  neighbors.  The  transmitting  terminal 
listens  to  the  channel  to  hear  if  a  packet  is  being 
rebruadcasl  by  a  neighbor.  If  after  a  prespecified 
lime  interval,  the  transmitting  node  does  not  hear 
the  packet  rebroadcast,  it  retransmits  the  packet. 
But  the  packet  may  have  been  successfully  received 
by  the  neighbor  even  though  the  originator  does 
not  hear  the  rebroadcast.  We  assume  that  duplicate 
packets  are  transmitted  and  are  deleted  only  at  the 
final  destination.  We  assume  an  end-lo-end  acknowl¬ 
edgment  is  returned  to  the  originator  from  the  final 
destination.  In  this  paper  we  assume  that  passive 
acknowledgments  are  always  heard  and  ignore  the 
effect  of  end-lo-end  acknowledgments.  Alternately, 
these  acknowledgements  could  have  been  added  to 
the  required  traffic. 

We  depict  the  topology  of  the  network  by  a 
graph  where  terminals  are  represented  by  nodes. 
The  nodes  are  connected  by  a  link  if  they  can  hear 
each  other's  transmissions,  i.e.,  if  they  are  neigh¬ 
bors.  A.s  an  example  see  figure  1.  Node  A  can 
hear  node  B,  but  not  node  C.  Node  B  can  hear 
both  nooes  A  and  C.  Node  C  can  hear  node  B, 
but  not  node  A.  If  node  A  is  transmitting  to  node 
B  and  node  C  begins  transmitting,  then  the  trans¬ 
mission  from  A  to  B  may  be  lost  depending  upon 
the  "capture"  assumptions  we  make.  A  conservative 
assumption  is  that  the  .A  to  B  transmission  is  lost  - 
this  i.s  known  as  zero  capture.  Alternately,  perfect 
capture  assumes  that  this  transmission  is  success¬ 
fully  received.  Half-amplitude  capture  assumes  that 
the  transmission  is  lost  if  C  dominates  A  at  B. 
This  can  happen  if  C  is  closer  to  B  than  A  is  to  B, 
or  has  a  greater  signal  strength  perceived  at  8 
than  A  has.  If  A  dominates  C,  then  the  transmis¬ 
sion  is  successful.  We  will  consider  these  different 
capture  situations  below.  Note  that  under  CSMA  if 
node  B  is  transmitting,  neither  A  or  C  is  allowed  to 
transmit. 

We  assume  that  a  routing  has  been  specified. 
This  takes  the  form  of  deciding  which  of  the  neigh¬ 
bors  are  to  rebroadcast  a  packet  from  a  particular 
source  to  a  particular  destination.  Thus  the  amount 
of  traffic  that  a  terminal  wishes  to  send  to  its 
neighbor  can  be  computed.  If  these  rebroadcast 
packets  are  scheduled  at  a  random  time  far  in  the 
lulure  the  I'ois.son  assumption  for  traffic  streams  is 
preserved.  We  assume  that  the  traffic  between 
neighbors  is  specified  and  form  independent  Poisson 
processes.  When  considering  exponentially  distri¬ 
buted  packet  lengths  we  assume  that  the  packet 
length  IS  reassigned  independently  at  each  hop. 
Although  this  leads  to  some  anamolies  in  interpret¬ 


ing  the  results  it  is  a  key  assumption  in  the 
model  and  is  analogous  to  the  "independence 
assumption"  in  queuing  networks. 

The  details  of  CSMA  for  a  single  hop  net¬ 
work  ran  lie  found  m  the  papers  by  Tobagi  and 

K.leinrock^ '  ^ .  Tobagi  has  also  developed  some 

(2) 

simple  models  for  two-hop  networks'  .  Details 
of  a  packet  radio  network  can  be  found  in  a 

paper  by  Kahn'^V  A  discussion  of  routing  in 
multihop  packet  radio  can  be  found  in  the  paper 

by  Gitman,  Van  Slyke,  and  Frank^^\ 

111  GENERAL  RESULTS 

In  this  section  we  develop  some  expressions 
that  are  valid  for  the  packet  radio  network  we 
have  modelled  above  using  CSMA  and  with  an 
arbitrary  packet  length  distribution.  Let  i  be  a 
node,  Oj  one  of  its  neighbors,,  N.*  the  set  of  all 

the  neighbors  of  i,  and  Nj  the  set  of  all  i's 

neighbors  and  including  i.  Without  loss  of  gene¬ 
rality  assume  the  average  packet  length  at  each 
terminal  is  unity.  Let  G.  be  the  total  rate  (in 
packets/sec)  of  all  scheduled  traffic  at  node  i. 
This  includes  originating  traffic  and  all  resched¬ 
uled  traffic  and  is  assumed  to  be  Poisson. 

Node  i  is  either  busy  (transmitting)  or  idle. 
It  will  transmit  a  scheduled  packet  if  at  the 
instant  it  is  scheduled  all  nodes  in  N.  are  Idle. 

Let  A  be  a  set  of  nodes.  Let  P(A)  be  the  proba¬ 
bility  that  at  a  random  instant  all  nodes  in  A  are 
idle.  The  nodes  not  in  A  may  or  may  not  be 
idle.  Similarly  P(i),  P(i,A),  P(A,B)  are  the 
probabilities  that  i  is  idle,  i  and  nodes  in  A  are 
idle,  and  all  nodes  in  A  and  B  are  idle. 

Since  traffic  is  scheduled  at  node  i  with  a 
Poisson  rale  G^,  will  be  transmitted  only  if  N.  is 

idle,  and  transmissions  have  unity  average 

length,  the  probability  that  i  is  busy  is  given  by 

l-P(i)  =  G|P(Nj)  (1) 

If  i  is  busy  then  under  CSMA,  n  must  be  idle. 
Then  since 

P(np  =  Pfn^.i)  ♦  P(nji  busy)  (l-P(i)) 
and  PCnJi  busy)  =  1 ,  we  have 

Pfnj.i)  =  P(nj)  ♦  P(i)  -  1  (2) 

Similarly ,  if  A  T  N^*, 

P(A,i)  ^  P(A)  ♦  P(i)  -  1  (3) 

letting  A  -  in  equation  (3).  and  us.  .g 
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P(Nj»)  =  P(Nj)/P(i|Nj*) 
we  get 

Equation  (4)  is  often  found  in  CSMA  literature. 

A  packet  from  i  to  n.  will  be  transmitted 
when  it  is  scheduled  if  Nj  is  idle.  During  the 
transmission  all  nodes  in  N.*  will  be  idle.  It  will 
be  successfully  received  at  n^  if  (1)  all  neighbors 
of  Oj  not  in  N.  are  idle  at  the  beginning  of  trans¬ 
mission  and  (2)  depending  upon  the  capture 
assumptions,  none  of  these  nodes  begin  transmit¬ 
ting  until  i's  transmission  is  ended.  Otherwise  a 

collision  will  occur.  Let  S.  _  be  the  rate  in 

i.n. 

packets/sec,  determined  by  the  routing  and  as¬ 
sumed  Poisson,  of  the  traffic  that  i  wishes  to 
send  to  n^.  This  is  the  required  throughput  or 

offered  traffic.  Let  G.  _  be  the  rate  of  all 

i.nj 

scheduled  traffic  from  i  to  n..  We  have  also 

assumed  that  all  these  streams  are  Poisson  and 
independent.  Of  these  Gj  packets  (per  sec- 

I  f  III 

ond),  Sj  must  be  successful.  Thus 
1 

^i.nj 

g— i  =  PCNj.N^  )P  (X)  (5) 

i.i\  i 

Let  D;  „  be  the  nodes  in  that  dominate  i  at 

i,nj  nj 

Oj.  Then  P(X)  is  the  probability  that  no  nodes 
in  D.  ^  begin  transmission  during  the  transmis¬ 
sion  of‘  the  packet  from  i  to  n^  conditioned  on 
node  I  transmitting,  all  nodes  in  N^*  being  idle 

throughout  the  transmission,  and  all  nodes  in 
N_  being  idle,  at  least,  initially.  If  D.  is 

n  j  i.nj 

empty  or  perfect  capture  is  assumed  then  P(X)=1. 


Markov  process  where  the  stales  are  identified  by 
which  nodes  are  idle  and  which  are  busy.  Let  D 
be  a  set  of  busy  nodes.  because  of  CSMA,  no 
nodes  in  l>  may  lu’  neighbors  of  each  other.  We 
shall  refer  to  such  a  sei  of  non-neiqhbors  (or 
strangers)  as  being  "strange."  Let  Q(D)  be  the 
probability  that  at  an  instant  of  time,  all  nodes  in 
D,  which  IS  strange,  are  busy,  and  all  nodes  not 
in  D,  are  idle.  Then  each  strange  set,  D,  repre¬ 
sents  a  stale  in  a  Markov  system,  and  Q(D)  is 
the  slate  probability.  The  collection,  D,  of  all 
strange  sets.  represents  all  the  states  in  the 
system.  In  particular,  the  null  set  D  =  ♦,  re¬ 
presents  the  slate  that  all  nodes  are  idle. 

Assume  the  system  is  in  state  D.  It  will 
leave  the  state  if  any  icD  stops  transmitting. 
This  happens  with  rate  =  1 .  Thus  the  transi¬ 
tion  to  state  |D-i}  occurs  with  rate  pj.  The  only 

other  way  to  leave  state  D  is  for  one  of  the  idle 
nodes  that  is  a  stranger  to  all  icD  to  begin  to 
transmit.  This  occurs  with  rate  X.  =  G..  (We 

-•  J  J 

use  p.  and  X^  for  the  moment  for  clarity).  Let 
Nq  be  the  set  of  all  neighbors  of  all  nodes  in  D. 
Then  the  transition  from  D  to  |D+j),  j^N^,  occurs 
with  rate  X^.  The  global  balance  equations  for 
this  system  are 

(IPj  ♦  IXj  )  0(D)=  2Xj  Q(D-i)t  i  p.  (3(Dtj)  (7) 

icD  itO 

where  Dcj9.  the  collectio.-'  of  all  strange  sets. 

It  is  easy  to  see  that  these  equations  are 
satisfied  by 

Q(D)  =  J  Q(D-i)  =  GjQ(D-i),  icD  (8) 
Thus 

Q(D)  =  (  n  G.)  0(*)  (9) 

icD  ' 


The  total  scheduled  traffic  at  a  node  is 
given  by 


where  we  adopt  the  convention  that  fl  Gj  =  1 . 
Summing  over  all  Dc^ff,  we  get  ica 


G; 


njcNj* 


(6) 


I  OCD) 


=  2  (n  G.)  Q(a)  =  1 
Dt^  icD 


(10) 


From  equations  (1)  through  (6)  we  wish  to  derive 
a  relation  between  the  S.  _  and  G;,  and  deter- 

1 ,  nj  1 

mine  the  maximum  Sj  ^  the  network  can  support. 

This  we  call  the  (maximum)  throughput  or  capa¬ 
city.  In  the  next  section  we  develop  this  rela¬ 
tionship  for  exponential  packet  lengths  and  arbi¬ 
trary  topologies.  Later  we  present  some  results 
lor  non-exponential  lengths  and  simple  topologies. 

IV.  EXPONENTIALLY  DISTRIBUTED  PACKET 
LENGTHS 

If  the  packet  lengths  are  exponentially 
distributed,  then  the  system  can  be  viewed  as  a 


In  the  previous  section  we  found  we  were 
interested  in  quantities  like  P(A),  where  A  is  any 
set  of  nodes,  and  P(A)  is  the  probaliility  that  all 
nodes  in  A  are  idle,  and  all  nodes  not  in  A  may 
or  may  not  be  idle.  This  can  be  found  by  sum¬ 
ming  y(D)  over  all  sets  U  that  do  not  contain 
nodes  in  A .  Thus 

2  c  ‘^i) 

P(A)  =  n.O(D)=  - icD - 

DCA^  1  (n  Gj) 

DC-e  icD 
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whcr*  IJ,  A  refers  to  nil  strnnge  sets  contained  in 
the  (omplement  of  A.  We  adopt  the  shorthand 
notation . 

SP(B)  =  i  (n  G.  )  (12) 

DCBicD  ‘ 


r.qualions  fll)  and  (S)  can  be  used  for  any 
topology  to  generate  the  solution  to  our  problem 

I  he  equations  relating  tfie  S  and.  G  can  be 

I » ti-  j 

solved  iteratively.  For  example,  equation  (5)  now 
becomes 


where  SP  refers  to  sum  of  products.  Thus 

P(A)  =  SP(A‘^)/SP(»rt  (13) 

where  is  the  set  of  all  nodes. 


S  SP(IN>N^  1)*^ 
’  •  SPW — 


As  an  example  consider  the  chain  of  four 
nodes  shown  in  Hgure  2.  We  assume  Sj2=  S2j= 

‘*’23'  ®32*  ®34'  ^43'  ^  simplicity,  and  perfect 
capture.  Also  note  that  Gj2“  Gj,  G^^"  ^4 
by  symmetry  Gj=  G^,  G2j=  G^^,  and  G22=  G22- 
Also  from  equation  (6),  G2=  G2j*  G23=  G^.  From 
equation  (5),  we  have 

J  =  P(l,2,3)  =  P(l,2,3,4) 

^21  *^23 

But 

SPM=  X  (flGj)=l+Gj*G2+G3+G4*GjG3»G2G^>GjG^ 
DcAcD 

=  1  ♦  2Gj+  2G2+  2GiG2^  Gj  =  2k 
and 

P(l,2,3,4)=l/A,  P(l,2,3)=(l+G^)M=(l+Gj)M. 
Solving,  we  get 

G2  =  Gj(2>Gj)  and  S  =  Gj(l»Gj)/A  . 
or 

S  =  Gj(l  ♦  Gj)/(1  ♦  6Gj^  7Gj^+  2Gj^). 

We  can  now  find  the  maximum  value  of  S  possible, 
the  throughput  of  the  chain,  which  is  .128, 
obtained  when  Gj=  0.71. 

We  can  also  consider  half-amplitude  or  zero 
capture.  For  example,  consider  zero  capture. 
Then  when  2  is  transmitting  to  3,  4  can  inter¬ 
fere  But  1,  2  and  3  are  idle,  and  4  is  initially 
idle.  1he  transmission  is  successful  if  2  finishes 
transmission  before  4  starts.  But  these  are 
events  occurring  in  exponential  time. 


where  by  A«^B  we  mean  the  union  of  A  and  B. 
For  perfect  capture  P(X)=1.  For  other  capture 
modes,  the  computations  for  P(X)  are  straight¬ 
forward  but  can  become  tedious  for  complex 
topologies . 

Evaluation  of  sums  of  products  in  equation 
(12)  are  made  easier  by  the  following  two  rules. 
Consider  two  sets  of  nodes  A  and  B  such  that  no 
node  in  A  can  hear  any  node'  of  B .  These  sets 
may  be  called  non-communicating.  Then 


SP(A»B)  =  SP(A)  SP(B), 

A&B  non-communicating  (IS) 

Also, 

SP(A)  =  SP(A-i)  ♦  G.Sr(A-N.),  itA  (16) 

To  prove  these  rules  just  consider  all  products. 
We  have  successfully  evaluated  many  complex 
topologies  with  these  procedures. 

There  are  other  relations  which  will  be  found 
useful  in  extending  our  model  to  more  complex 
situations.  We  prove  some  of  these  below.  Let  C 
be  a  cut,  i.e.,  a  set  of  nodes  that  divides  the 
network  into  three  parts  A,B,  and  C,  where  A 
and  B  are  non-ccmmunicating.  Let  A=A.«  A-.  B 
=  Bj+  B2  where  Aj'^A2=  Bj^B2=4.  Then^ 

P(A, ,C,B,)  SP(A-^B,)  SP(A,) 

P(Aj|C,Bj)=  p(c,Bj)  ^  SPYAfBp”  SP(Ay 

But 

P(Aj,C)  SP(A,^B)  SP(A2) 

P(Ai|C)  =  ■■?((; 5-  =  sFTA^Fr  "  gPTST" 


Thus  P(Aj|C,Bj)  =  P(Aj|C),  C  a  cut  (17) 


Thus  P(X)  =  "17(5”  ■  ■  by  symmetry.  Now 

=  P(l,2,3,4)  — unchanged. 

23  4  21 

c 

i,—  IS  a  little  harder  to  obtain,  but  is  a  straight- 
forward  calculation. 


We  also  have 

P(Aj,  Bj|C)  =  P(Aj|C)  P(BjjC),  C  a  cut  (18) 
in  particular  if  C  =  N^*,  then 


PdjNjV  B)  =  P(ijN.«),  BC  N.*^  (19) 

V  Px^mgles 

We  consider  here  three  different  topological 
structures  with  exponentially  distributed  packet 
lengths  and  perfect  capture.  We  assume  all 
^  S,  lor  all  1.  and  take  full  advantage  of 

symmetry  The  three  topologies,  .shown  in  fig¬ 
ures  .l.'l,  and  5,  are  a  chain,  a  ring,  and  a  star, 
all  with  various  lengths.  In  each  case  we  find 
the  maximum  throughput.  S.  These  are  given  in 
Table  I. 

Note  that  for  a  ring  greater  than  7  nodes 
the  maximum  throughput  is  the  same  as  that  for  a 
long  chain.  This  is  expected  since  the  congestion 
IS  now  in  the  middle  and  it  is  unimportant  whe¬ 
ther  or  not  the  chain  is  closed.  A  star  with  two 
legs  is  just  a  chain  with  N  for  the  star  replaced 
by  2N+1  for  the  chain.  Consider  the  star  config¬ 
uration  as  representing  the  center  node  (0) 
trying  to  transmit  to  some  node  or  nodes  far  away 
via  many  repeaters.  For  one  leg.  the  maximum 
rate  is  .086.  For  two  legs,  the  maximum  rate  is 
2S  or  .172.  exactly  twice.  The  results  arc  shown 
in  Table  II  where  the  throughput  of  the  center 
node  IS  given  by  LS.  We  see  from  Table  11  that 
whereas  the  throughput  doubles  for  L  =  2,  it 
increases  only  by  20%  when  L=3,  by  4-5%  further 
when  L-4.  and  by  2%  when  L=5.  Congestion  at 
the  central  node  is  limiting  its  ability  to  increase 
Its  throughput.  Additional  legs,  beyond  three, 
are  not  really  helpful.  We  are  evaluating  other 
topologies  to  find  ways  to  increase  the  throughput 
of  a  central  node  in  such  a  configuration. 

VI  Non-exponential  Packet  Lengths 

For  non-exponential  packet  lengths,  equa¬ 
tions  (1)  through  (6)  are  still  valid  but  are 

insufficient  to  solve  for  the  G;  and  the  S;  _  .  For 

1  I  .n; 

a  three  node  chain  and  a  star  with  any  numoer  of 
legs  out  N  =  1  we  have  been  able  to  prove  equa¬ 
tion  (IS)  for  arbitrary  packet  length  distribu- 
t.ons.  The  equations  for  P(A)  are  identical  in 
form  as  for  exponential  lengths  but  the  terms 
K(X)  lor  half-amplitude  and  zero  capture  are 
di'lerenl.  For  example,  for  constant  packet 

•G 

lengths  instead  of  1/(1+G)  we  now  have  e 
These  iwo  simple  topologies  can  be  completely 
an.ilyzed  lor  any  capture  mode.  Further  details 
Will  not  be  given  here. 

Wc  have  not  been  able  to  derive  a  similar 
n'ation.'hip  foi  other  topologies.  However,  we 
fed  th.if  equations  (17)  through  (19)  are  tempting 
approximations  to  try  Then  if  together  with 
ega.itionv  '!)  through  (')  they  are  sulficient  to 
.leiive  ,  i.ougr.  equations  to  find  the  throughput, 
tne  lesuiting  approximate  solution  has  some  ap- 
[xal  Wc  heve  been  able  to  do  this  for  several 
siT.p'ie  topologies  but  have  not  verified  the  ac- 
iiiiaiv  ot  the  atgiroximation  Consideration  of 
o  iicr  than  [x  iTcct  rapture  in  these  cases  becomes 
very  dillicult. 


VII  extensions  and  Conclusions 


We  have  presented  a  simple  but  fairly  realis¬ 
tic  model  of  a  multihop  packet  radio  network  and 
have  obtained  maximum  liircuglipiiis  fur  general 
topologies  and  exponential  message  lengths.  The 
procedures  are  easiest  to  implement  when  perfect 
capture  is  assumed.  Other  capture  modes  can  be 
handled  in  a  straightforward  but  somewhat  more 
tedious  manner.  We  have  assumed  perfect  recep¬ 
tion  of  acknowledgments  and  have  not  included 
additional  traffic  due  to  end-to-end  acknowledg¬ 
ments.  Some  aspects  of  acknowledgments  can  be 
included  by  increasing  the  required  traffic.  We 
are  investigating  the  effect  of  imperfect  acknowl¬ 
edgments  and  different  retransmission  strategies. 
The  model  should  still  be  usef-’  under  these 
extensions. 

We  have  been  able  to  derive  results  for 
non-exponential  packet  lengths,  but  only  for 
simple  topologies.  Our  procedure  suggests  a 
simple,  and  probably  good,  approximation  to  be 
used  for  more  complex  topologies.  However, 
imperfect  capture  becomes  a  more  serious  prob¬ 
lem.  We  are  investigating  these  extensions. 

Finally,  we  are  applying  these  techniques  to 
evaluate  different  routing  strategies  and  trans¬ 
mission  schemes  in  a  multihop  packet  radio  net¬ 
work. 
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Table  I.  Maximum  throughput  (S) 


Number 

of 

Nodes,  N  Chain 

Star 

Ring  L=2  L=3  L=4 

1 

1.000 

1.000  .167  .103  .074 

2 

.500 

.500  .111  .076  .057 

3 

.167 

.167  .097  .072  .055 

4 

.128 

.073  .092  .070  .054 

5 

.111 

.100  .069  .054 

6 

.102 

.083 

7 

.097 

.087 

0 

.094 

.085 

9 

.092 

.086 

10 

.091 

.086 

CD 

.086 

.086  .  086  .  069  .  054 

Table  11.  Maximum  Throughput  of  the  Central 
Node  in  a  Large  Star  Arrangement. 


Number  of  Legs  Maximum  Throughput 


1 

2 

3 

4 

5 


.086 

.172 

.207 

.216 

.220 


Figure  5.  A  stor  network  (L  legs) 


Figure  2.  A  four  node  choin 


Figure  4.  A  ring 
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Figure  3.  A  choin 
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Abstract 

We  consider  the  problem  of  obtaining  exact 
expressions  for  throughput  and  blocking  probabil¬ 
ities  in  multihop  packet  radio  networks  operating 
under  CSMA.  We  have  obtained  exact  results  for 
a  general  class  of  message  lengths,  for  general 
topologies,  and  for  perfect  capture.  These 
results  were  obtained  by  assuming  perfect  ack¬ 
nowledgments.  Here,  we  extend  these  results  to 
include  situations  where  acknowledgments  are  not 
always  heard.  We  also  introduce  and  analyze  a 
new  acknowledgment  protocol  that  significantly 
improves  performance. 


I ,  Introduction 

We  consider  the  problem  of  obtaining  exact 
expressions  for  throughput  and  blocking  probabil¬ 
ities  in  multihop  packet  radio  networks  operating 
under  carrier  sense  multiple  access  (CSMA). 
Procedures  have  been  developed  to  analyze  gen¬ 
eral  topologies  for  a  general  class  of  packet 

length  distribution  and  perfect  capture.*  That 
work  assumed  perfct  acknowledgements.  Here, 
we  extend  these  results  to  include  situations 
where  acknowledgments  are  not  always  heard.  We 
also  introduce  and  analyze  a  new  acknowledgment 
protocol  that  significantly  improves  performance. 

II.  The  Network  Model 

We  consider  the  problem  of  analyzing  the 
throughput  capabilty  of  a  multihop  packet  radio 
network  operating  under  carrier  sense  multiple 
access  (CSMA).  Thus,  we  assume  that  the  net¬ 
work  is  comprised  of  terminals  equipped  with 
radio  transponders  suitable  for  broadcasting  data 
over  a  limited  distance.  In  general,  the  source 
and  destination  terminals  cannot  hear  each  other 
directly,  and  thus  data  has  to  be  relayed  by  one 
or  more  intermediate  devices.  A  separate  set  of 
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devices,  called  repeaters,  may  exist  for  this  pur¬ 
pose,  or  the  terminals  themselves  may  relay  mes¬ 
sages  for  one  another. 

Control  of  the  network  is  completely  distrib¬ 
uted.  i.e.,  no  station  or  central  control  mech¬ 
anism  IS  assumed  to  exist.  Rather,  we  assume 
that  each  source  terminal  has  prestored  one  or 
more  routes  to  ail  destinations  and  includes  ail 
necessary  routing  information  in  the  packets  it 
transmits.  These  assumptions  are  made  merely  to 
simplify  the  presentation.  In  fact,  the  results 
presented  are  valid  for  networks  using  alternate 
routing  as  long  as  routing  changes  are  not  made 
over  short  time  intervals. 

Exogenous  traffic  is  modeled  as  independent 
Poisson  processes  arriving  at  each  source  node, 
with  appropriate  rates  and  packet  lengths.  The 
topology  is  specified  by  a  listing  of  which  termi¬ 
nals  (or  repeaters)  can  hear  each  other.  In  the 
remainder  we  will  not  distinguish  between  termi¬ 
nals  and  repeaters  and  will  refer  to  them  col¬ 
lectively  as  either  terminals  or  nodes.  In  general 
the  transmissions  of  one  terminal  can  be  heard  by 
many  other  terminals.  The  routing  will  specify 
which  terminal  is  to  repeat  the  packet,  if  ne¬ 
cessary. 

If  two  or  more  transmissions  are  simulta¬ 
neously  heard  by  a  teminal  (called  a  "collision") 
at  least  one,  and  possibly  both,  is  "lost"  and 
must  be  retransmitted.  We  assume  retransmis¬ 
sions  are  scheduled  at  a  random  instant  in  time 
sufficiently  far  in  the  future  so  as  to  preserve 
the  Poisson  nature  of  the  combined  scheduled 
traffic  stream,  which  now  consists  of  exogenous 
traffic  and  rescheduled  traffic.  For  this  study, 
we  assume  that  a  packet  can  be  retransmitted  as 
many  times  as  is  necessary,  i.e.,  that  there  is  no 
maximum  allowable  number  of  retransmissions. 

At  any  time,  terminals  may  either  transmit  or 
receive,  they  cannot  do  both  simultaneously. 
Before  transmitting,  a  terminal  senses  the  chan¬ 
nel.  If  it  detects  that  any  of  its  neighbors  (i.e  , 
terminals  that  it  can  hear)  are  transmitting  (by, 
eg.,  sensing  the  carrier)  It  reschedules  the 
transmission  as  for  collided  packets  above.  If  at 
the  scheduled  time  for  a  transmission,  the  termi¬ 
nal  is  already  engaged  in  transmitting  a  packet, 
the  new  packet  is  also  rescheduled  as  above 
Thus  packets  are  continually  rescheduled  until 
they  are  successfully  delivered  to  the  next  termi¬ 
nal  on  their  route.  We  assume  that  the  total 
stream  of  traffic  scheduled  by  any  terminal  is  a 
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Poisson  process.  This  includes  onqinating  traffic 
and  packets  rescheduled  either  due  to  collisions 
or  due  to  the  channel  having  been  sensed  busy 
This  scheme  is  called  carrier  sense  multiple  access 
(CSMA).  This  Poisson  assumption  is  valid  for  the 
assumptions  made  above  and  will  yield  accurate 
results  for  throughput. 

It  is  possible,  due  to  non-zero  propagation 
delay,  that  collisions  ol  transmissions  from  neigh¬ 
boring  terminals  may  still  take  place  despite  the 
CSMA  strategy.  This  will  occur  if  a  terminal 
senses  the  channel  before  another  terminal's 
transmission  is  received.  This  effect  is  small  if 
terminals  are  reasonably  close  or  are  not  trans¬ 
mitting  at  high  speed.  We  will  ignore  this  phe¬ 
nomena  here,  and  assume  that  all  transmissions 
are  instantly  heard  by  their  neighbors. 

A  passive  acknowledgement  is  used  for 
transmission  to  neighbors.  The  transmitting 
terminal  listens  to  the  channel  to  hear  it  a  packet 
is  being  rebroadcast  by  a  neighbor.  If  after  a 
prespecified  time  interval,  the  transmitting  node 
does  not  hear  the  packet  rebroadcast,  it  retrans¬ 
mits  the  packet.  But  the  packet  may  have  been 
successfully  received  by  t!.e  neighbor  even 
though  the  originator  does  not  hear  the  rebroad¬ 
cast.  Duplicate  packets  may  be  transmitted  and 
deleted  only  at  the  final  destination  or  they  may 
be  detected  and  deleted  earlier.  An  end-to-end 
acknowledgment  is  returned  to  the  originator  from 
the  final  destination.  In  our  original  work  we 
assumed  that  passive  acknowledgments  were 
always  heard  and  ignored  the  effect  of  end-to-end 
acknowledgments.  The  end-to-end  acknowledg¬ 
ments  could  easily  be  added  to  the  required 
traffic.  In  this  paper  we  consider  the  situation 
when  passive  acknowledgments  may  not  always  be 
heard.  We  also  introduce  and  analyze  a  new 
acknowledgment  protocol. 

We  depict  the  topology  of  the  network  by  a 
graph  where  terminals  are  represented  by  nodes. 
The  nodes  are  connected  by  a  link  if  they  can 
hear  each  other's  transmissions,  i.e.,  if  they  are 
neighbors.  As  an  example  see  Figure  1.  Ni^e  A 


ABC 


Figure  1 

Network  Topology 


can  hear  node  B,  but  not  node  C.  Node  B  can 
hear  both  nodes  A  and  C.  Node  r  can  hear  node 
B,  but  not  node  A.  If  node  A  la  i-ansmitting  to 
node  B  and  node  C  begins  transmitting,  then  the 
transmission  from  A  to  B  may  be  lost  depending 
upon  the  "capture"  assumptions  we  make.  .A 
conservative  assumption  is  that  the  A  to  B  trans¬ 
mission  is  lost  --  this  IS  known  as  zero  capture. 
Alternatively,  perfect  capture  assumes  that  this 
transmission  is  successfully  received.  Half- 
amplitude  capture  assumes  that  the  transmission  is 
lost  if  C  dominates  A  at  B.  I'his  can  happen  if  C 
is  closer  to  H  than  A  is  to  B,  or  has  a  qreuier 
signal  strength  perceived  at  B  than  A  has.  II  A 


dominates  C,  then  the  transmission  is  successful. 
However,  in  all  cases  of  a  collision' "we  assume  the 
later  transmission  is  lost.  Thus  if  C  'rs-^ans- 
mitting  to  B,  this  packet  is  iosi  in  all  cases.  We 
will  consider  only  perfect  capture  situations 
below.  Note  that  under  CSMA  if  node  B  is  trans¬ 
mitting  neither  A  or  C  is  allowed  to  transmit. 

We  assume  that  a  routing  has  been  specified. 
This  takes  the  form  of  deciding  which  of  the 
neighbors  are  to  rebroadcast  a  packet  from  a 
particular  source  to  a  particular  destination. 
Thus  the  amount  of  traffic  that  a  terminal  wishes 
to  send  to  its  neighbor  can  be  computed.  If 
these  rebroadcast  packets  are  scheduled  at  a 
random  time  far  in  the  future  the  Poisson  as¬ 
sumption  for  scheduled  traffic  streams  is  pre¬ 
served.  We  assume  that  the  packet  length  is 
reassigned  independently  at  each  hop  Although 
we  have  extended  the  analysis  to  include  non¬ 
exponential  packet  length  distributions,  we 
consider  only  exponential  packet  lengths  here. 
The  results  are  similar. 

The  details  of  CSMA  for  a  single  hop  net¬ 
work  can  be  found  in  the  papers  by  Tobagi  and 

Kleinrock*.  Tobagi  has  also  developed  some 

simple  models  for  two-hop  networks*.  Details  of 
a  packet  radio  network  can  be  found  in  a  paper 

by  Kahn*.  Details  of  our  analytic  procedure  were 

presented  at  ICC'80*.  More  details  can  be  found 

in  the  thesis  of  Sahin*. 


Analytic  Procedure 


In  this  section  we  review  the  analytic  pro¬ 
cedure  for  perfect  acknowledgments*.  Let  i  be  a 
node,  Oj  one  of  its  neighbors,  N.  the  set  of  all 
the  neighbors  of  i,  and  N.  the  set  of  all  i's 
neighbors,  including  i.  Let  g.  be  the  total  rate 

(in  packets/sec)  of  all  scheduled  traffic  at  node 
i.  This  includes  originating  traffic  and  all  re¬ 
scheduled  traffic  and  is  assumed  to  be  Poisson. 
Let  1/p.  be  the  average  length  of  packets  trans¬ 
mitted  by  node  i.  Let  Gj  =  ®  normalized 

rate. 

Node  i  is  either  busy  (transmitting)  or  idle. 
It  will  transmit  a  scheduled  packet  if  at  the  in¬ 
stant  it  is  scheduled  all  nodes  in  N.  are  idle.  Let 

A  be  a  set  of  nodes.  Let  P(A)  b?  the  probability 
that  at  a  random  instant  all  nodes  in  A  are  idle. 
The  nodes  not  in  A  may  or  may  not  be  idle. 
Similarly  P(i),  P(i.A),  P(A,B)  are  the  proba- 

bilties  that  node  i  is  idle,  node  i  and  nodes  i.n  A 
are  idle,  and  all  nodes  in  A  and  B  are  idle. 

A  packet  from  i  to  n.  will  be  transmitted 

when  it  is  scheduled  if  N.  is  idle.  During  the 

*  • 

transmission,  because  of  CSMA,  ail  nodes  in 
will  be  idle.  It  will  be  successfully  received  at  n^ 
if  dll  neighbors  of  Oj  not  in  Nj  are  also  idle  at 
the  beginning  of  the  transmission.  Otherwise 


a  collision  will  occur.  Let  s 


i,n. 


be  the  rate  in 


packets/sec,  determined  by  the  roulmq  ol  the 
traffic  that  i  wishe.s  to  send  to  n^.  This  is  the 

required  throughput  or  offered  traffic.  Let  q 

'"i 

be  the  rate  of  all  scheduled  traffic  from  i  to  n^. 

We  have  assumed  that  all  these  streams  are  Pois¬ 
son  and  independent.  Of 


per  second ,  s. 


."i 


these  Qj  ^  packets 
must  be  successful.  Thus 


S: 


t,n. 


•i,n. 


i,nj 

577  ) 

l.Aj  I 


where  S.  _  =  s.  /p.  and  G;  „ 
i.n  j  i.n  j  O  i.rtj 


(1) 


where  SI’  refers  to  sum  of  products.  Thus 

P{A)  =  Si’(A‘')/St'(  N)  (6) 


The  above  equations  can  bo  used  for  any 
topology  to  generate  the  ;;olution  to  our  pioblom 
The  equations  relating  the  S  ,  G  .  and  G 

can  be  solved  iteratively.  For  example,  equation 
(1)  now  becomes 


SPdNj  t  /) 


(7) 


We  have  also  given  many  rules  to  make  the  writ¬ 
ing  of  these  equations  easier*. 

IV.  Passive  Acknowledgments 


The  total  scheduled  traffic  (normalized)  at  a 
node  is  given  by 


Gi=  ^  Gi.n,  <2) 


If  the  packet  iengths  are  exponentially 
distributed,  then  the  system  can  be  viewed  as  a 
Markov  process  where  the  states  are  identified  by 
which  nodes  are  idle  and  which  are  busy.  Let  D 
be  a  set  of  busy  nodes.  Because  of  CSMA,  no 
nodes  in  D  may  be  neighbors  of  each  other.  Let 
Q(D)  be  the  probability  that  at  an  instant  of 
time,  all  nodes  in  D  are  busy,  and  all  nodes  not 
in  D  are  idle.  Then  each  set,  D,  represents  a 
state  in  a  Markov  system,  and  Q(D)  is  the  state 
probability.  In  particular,  the  null  set  D  = 
represents  the  state  that  all  nodes  are  idle. 

We  have  shown*  that 
n  Gj 

Q(D)  =  ^  (3) 

DcN  jcD 

where  N  is  the  set  of  all  nodes  and  we  consider 
n  G.  =  1. 
ie«  ' 

We  are  interested  in  quantities  like  P(A), 
where  A  is  any  set  of  nodes,  and  P(A)  is  the 
probability  that  all  nodes  in  A  are  idle,  and  all 
nodes  not  in  A  may  nor  may  not  be  idle.  This 
can  be  found  by  summing  Q(D)  over  all  sets  O 
that  do  not  contain  nodes  in  A.  Thus 


P(A)  =  I  Q(D)  = 
DcA'^ 


I  n  Gj 

DcA*^  icD 
I  fi~5j 

DcN  icD 


(4) 


where  OcA^  refers  to  all  such  sets  contained  in 
the  complement  of  A.  We  adopt  the  shorthand 
notation 


SP(B)  =  I  (  n  G^ 
DcB  icD  ' 


(5) 


In  this  section  we  extend  our  previous  work 
on  throughput  in  multihop  packet  radio  networks 
to  include  the  effects  of  passive  acknowledgments 
not  always  being  heard.  Let  node  j  be  a  neigh¬ 
bor  of  node  i.  we  have  already  seen  that 

Sj  j  =  Gj  jP(Nj|Nj)P(Nj)  =  Gj  jP(Nj,Nj)  (8) 

When  j  transmits  the  packet  to  the  next  node 
in  the  route,  i  can  hear  the  transmission,  and 
thus  receive  a  passive  acknowledgment,  it  all  ol 
its  neighbors  (except  j)  are  idle.  If  it  does  not 
hear  the  transmission,  then  after  a  suitable 
time-out  period  it  retransmits  the  packet.  Pre¬ 
viously  we  assumed  all  passive  acknowledgments 
were  heard.  Here  we  see  that  a  passive  ack¬ 
nowledgment  is  heard  with  probability  P(Nj|Nj). 

We  assume  that  the  time-out  period  is  selected  so 
that  all  nodes  have  the  opportunity  of  hearing 
exactly  one  transmission.  If  they  fail  to  hear  it. 
the  packet  is  retransmitted.  This  is  repeated 
until  an  acknowledgment  is  heard.  The  assump¬ 
tion  that  exactly  one  transmission  may  be  heard 
in  a  time-out  period  is  equivalent  to  assuming 
that  the  same  time  out  period  is  used  for  all 
nodes  and  that  it  is  long  enough  to  insure  one 
transmission . 

The  rate  Sj  ■  in  equation  (8)  includes  re¬ 
peated  packets,  because  of  failure  to  hear  ack¬ 
nowledgments.  Let  j  be  the  rate  of  packets 

for  which  passive  acknowledgments  are  heard. 
Then 


S*  .  =  q.  .  S.  . 

1.1  M 

(9) 

where 

‘^i.j  =  ■’(NjlNj)  = 

P(Nj,Nj) 

ttn;! 

(10) 

We  have  already  shown  how  to  solve  sets  of 
equations  like  (8)  for  the  Gj  ^  as  a  function  of 


the  Sj  j,  and  found  the  maximum  possible  ^ 

which  we  defined  as  the  throughput*.  For  ex¬ 
ample,  in  a  ^hain,  all  ^  -  S  fur  i  =  i  and  i  -  1 


3*5 


Here  we  see  how  to  modify  this  procedure  to 
account  for  passive  acknowledgments.  The  j 

are  determined  from  the  offered  traffic  and  the 
routing . 

We  identify  two  situations.  Packets  are 
retransmitted  if  they  are  unsuccessfully  trans¬ 
mitted  or  if  the  passive  acknowledgment  is  not 
heard.  In  the  latter  case,  a  duplicate  packet  is 
sent.  If  the  receiving  node  recognizes  the  dupli¬ 
cate  packet  and  deletes  it,  then  .  is  the  de¬ 
sired  rate  of  traffic  between  i  and  j.  If  a  simpler 
node  does  not  have  this  capability,  then  duplicate 
packets  are  retransmitted,  and  after  several  hops 
many  duplicates  exist.  In  this  latter  case,  j 

includes  these  duplicate  packets  and  must  be 
related  to  the  desired  rate  of  traffic.  We  will 
first  consider  a  chain. 

Consider  the  chain  shown  in  figure  2  con¬ 
sisting  of  n  nodes.  Assume  that  duplicate  pack¬ 
ets  are  not  detected.  We  assume  that  node  1  and 
n  wish  to  send  S  packets  per  second  to  each 
other.  All  other  nodes  are  just  repeaters.  When 
node  2  relays  a  packet  to  node  .1,  node  1  must  be 
idle,  and  all  passive  acknowledgments  are  always 
heard.  Thus  4^  2  ~  ^ 

SJ  2  =  2  =  S  (11) 

At  the  other  end,  node  n  does  not  repeat  the 
packets,  so  passive  acknowledgments  are  not 
possible.  However  every  transmission  must  be 
successful  since  there  are  no  interfering  nodes. 
Thus 

SS-l,n  =  Vl,n  <12) 

Node  i  attempts  to  transmit  all  ■  suc¬ 
cessful  packets  to  i*'l.  Thus 


V,i+1  ^i-l,i 


1  =  2 .  n-1  (13) 


Similarly,  in  the  reverse  direction,  we  have 

$♦  ,  =  S  ,  =  S 

n,n-l  n,n-l 


^2,1  ~  ®2,1 

and 


i  =  2 . n-l 


Equations  (11  to  14)  relate  Sj  j  and  Sj  j*  to  the 

desired  traffic,  S.  Note,  by  symmetry,  we  obtain 
relations  like  equations  (14)  by  replacing  1  and  j 
by  n-i+1  and  n-j+1,  respectively. 

As  an  example  we  evaluate  the  throughput  of 
a  four  node  chain.  I’lrst  we  assume  perfect 
reception  of  acknowledgments  and  use  only  equa¬ 
tion  (8)  with  all  S|  i=S.  Then 

S  =  G,2P(1,2,3)  =  G23P(1,2,3,4)  =  G.,,P(1,2,3) 


We  let  G,2  =  G,,  Gj,  ♦  Gjj  =  G*,  and  note  by 
symmetry  that  G3  =  G^  G4  -  G,,  and  P(l,2,3)  - 
P(2,3,4).  From  our  previous  work  we  have 

l+G^  1+G, 

P(l,2,3)  =  -p-  = 


P(l,2,3,4)  =  ^ 

D  =  U2Git2G2+2GiG2*Gf  =  (l*Gi  )**2G2(1*G, ) 


Thus  we  get 


Gj  =  G,  ♦  G,(1^G,)  =  G,(2*G,) 


^  ■  1*G,+2G2  "  (1*Gi)+2G,(!2+G,)  ^ 

We  can  solve  equation  (17)  for  the  maximum  value 
of  S,  the  throughput.  We  obtain  =  .128 

when  G,  =  .71. 

To  include  the  effects  of  passive  acknow¬ 
ledgments  we  use  equations  (9  to  13)  to  obtain 
for  arbitrary  length  chains. 


S12  =  S 


"  ‘^2.3‘’3,4  -  '^i,itl 
®n-l,n  “  ®n-2,n-l 


S,2*  =  S*,*  =  S 


—  ,  i=2 . n-2  (18) 


c  ♦  -  _ 2 _ 

i.in  - 

Returning  to  the  example  for  n=4  we  use 
equations  (18)  and  (10)  in  (8)  to  obtain 

S  =  G,2P(1,2,3)  =  q2iG23P(l,2,3,4) 

=  q2jG2,P(1.2,3) 

=  G,2P(1.2.3)  =  G23  =  G2,P1,2.3,4) 

(20) 

or  Gj  =  G,(l+G,)*G,(1»G,)*  =  G,(1+G,)(2^G,) 


J86 


J 


># 


The  throughput  here  is  S  =  .0'J8  when 

rriaX 

G,  =  .37,  a  reduction  of  23% 

In  Table  1  we  compare  the  effects  of  passive 
acknowledgments  for  various  length  chains.  We 
see  that  as  the  chain  becomes  long  the  through¬ 
put  vanishes.  Also  shown  in  that  table  is 

Table  1.  Effect  of  Passive  Acknowledgments 


Throughput 


Length 
of  Chain 

Perfect 
Acknowledq . 

Duplicates 

Detected 

Duplicates 
Not  Detected 

4 

.128 

.106 

1558 

5 

.111 

.083 

.069 

6 

.102 

.072 

.053 

7 

.097 

.066 

.044 

8 

.094 

.063 

.038 

9 

.092 

.061 

.034 

10 

.091 

.060 

.031 

• 

.086 

.057 

0 

ments  are  included  anti  duplicate  packets  are 
detected  and  not  transmitted  lurther. 

We  next  analyze  the  effect  of  passive  ack¬ 
nowledgments  on  throughput  in  star  netwiirks. 
We  will  consider  two  topologies  -  one  where  all 
legs  are  unconnected  from  each  other,  the  second 
where  the  nodes  closest  to  the  center  are  fully 
connected.  We  have  studied  these  networks  in 
our  previous  work  and  assumed  there  that  all 

passive  acknowledgments  were  heard.  ^  We  assume 
further  here  that  duplicate  packets  are  de¬ 
tected  and  not  retransmitted.  See  Figure  2 
for  the  configuration  of  the  star  network  with 
zero  connectivity,  L  legs,  and  K  nodes  in  each 
leg.  For  full  connectivity  the  nodes  0,l,Ktl, 
2K+1 . (L-1)K+1  are  fully  connected. 

The  equations  used  in  the  previous  section 
are  still  valid  here.  When  duplicate  packets  are 
detected,  all  >  =  S,  the  throughput  in  each 

leg.  The  end  conditions  are 


the  throughput  for  chains  when  duplicate  packets 
are  detected  and  not  transmitted  to  the  next 
node.  We  analyze  this  case  now. 

If  diiplicate  packets  are  detected  and  not 
transmitted  further,  then  Sj  j*  is  just  the  desired 

traffic  to  be  sent  from  i  to  j.  As  before  we  find 
S.  .•  from  the  offered  traffic  and  the  routing. 

^•J 

For  example,  in  a  chain  S?  ®  i  *  ^ 

i=l,...,n-l.  Thus  equations  (8  to  10)  combine 
to  give 


^i.l*  “  ""i-j 


{P(Nj,N,]2 


except  at  terminal  nodes  where  no  further  trans¬ 
mission  is  necessary  and  djj  =  1  fhis  can  be 

solved  as  before  to  yield  the  G.  ^  as  a  function  of 
the  Sj  j*  and  then  the  maximum  Sj  j*  or  through- 
put  can  be  found. 

We  present  the  analysis  for  the  four  node 
chain  analyzed  above.  Here  Sj  I  =  S 

for  i  =  1,2,3.  Again  we  use  symmetry  to  obtain 


(1-G,)  , 

S  =  - g—  =  G23 

D  =  (1+G,)*  ♦  2G2(1+G,) 


^2  1  — g- 


as  before. 


But  G2  ”  Gi  ♦  Gi(l^G\)^,  so 


Solving  we  get  S _ =  .106  when  Gi  =  .42 

max 

The  throughput  for  various  length 
chains  are  given  in  Table  1.  We  see  here  that 
there  is  a  34%  reduction  In  throughpir  for  long 
chains  when  the  effects  of  passive  acknowledg¬ 


‘^£K+L-1,(£+1)K  =  1  '  *  =  0 . L-1  (25) 

since  transmissions  to  the  end  nodes  are  always 
heard  and  passive  acknowledgments  are  not  sent. 
This  can  be  found  from  equation  (10).  We  fur¬ 
ther  assume  that 

Q  ~  1  >  ^  ”  0,...,L-1  (26) 

Transmissions  to  the  center  node  are  not  always 
heard.  Retransmissions  due  to  collisions  are 
included  in  our  analysis.  However  since  the 


Figure  2 

A  star  network  with  L  legs  and  zero  connectivity. 


2K+2  *. 
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center  node  dt'es  not  retransmit,  passive  ack¬ 
nowledgments  are  not  available.  Wt  assume  a 
perfect  end-to-end  acknowledgment  is  operating. 

Equations  8  to  10  have  been  solved  with 
perfect  passive  acknowledgments  and  with  imper¬ 
fect  passive  acknowledgments  and  duplicate  packet 
detection.  Two  topologies  have  been  studied  - 
zero  connectivity  and  full  connectivity  around  the 
center.  The  resultant  maximum  throughputs  per 
leg  are  shown  in  Tables  2  and  1  The  maximum 
throughput  from  the  center  node  is  given  in 
Table  4.  We  see  from  these  results  that  there  is 
again  a  34%  reduction  in  throughput  for  long  legs 
and  zero  connectivity.  Tor  full  connectivity  we 
find  a  43  to  46%  reduction  in  throughput  due  to 
the  failure  to  hear  passive  acknowledgments. 

From  our  previous  work  we  found  that 
except  for  short  legs,  the  fully  connected  topol¬ 
ogy  is  better  when  passive  acknowledgments  are 
not  considered.  This  is  because  inward  trans¬ 
missions  from  neighbors  of  the  center  are  not 
collided  with.  With  zero  connectivity  passive 
acknowledgments  for  transmissions  from  the  first 
ring  to  the  second  are  interfered  with  only  by 
the  center  node.  With  full  connectivity  all  other 
first  ring  nodes  add  to  this  interference.  Thus 
the  first  ring  will  have  to  retransmit  more.  In 
Table  4  we  compare  the  maximum  throughputs 
from  the  center  node  for  full  and  zero  connecti¬ 
vity  and  see  that  indeed  zero  connectivity  is 
better. 


Table  4.  Effect  of  Passive  Acknowledgments  on 
Star  Networks. 

Maximum  Throughput  from  the  Center 


Maximum  throughput 
per  Leg. 


No.  of 

No.  of 

Perfect 

Imper  fe,  i 

Legs , 

Hops  per 

Connec¬ 

Acknow¬ 

Acknow  • 

L 

leg .  K 

tivity 

ledgments 

ledqments 

5 

2 

Zero 

.225 

,  165 

>3 

.220 

.145 

6 

3 

Full 

.240 

.140 

>4 

.240 

.138 

9 

3 

Full 

.257 

.140 

>4 

.252 

.139 

V.  A  New  Protocol  for 
Passive  AcknowT^gemehts 

In  the  previous  section  we  discussed  the 
effect  of  imperfect  acknowledgments  on  the  maxi¬ 
mum  obtainable  throughput  for  various  topologies. 
We  saw  that  the  maximum  obtainable  througnpul 
was  reduced  more  than  30  percent  for  a  chain, 
and  more  than  44  percent  for  the  star  network. 
The  protocol  that  we  used  has  the  following 
disadvantages : 


Table  2:  Effects  of  Passive  Acknowledg¬ 

ments  on  Star  Networks.  Zero 
Connectivity  (Number  of  Legs, 
L=5)  -  Duplicates  Detected 


Maximum  throughput  per  Leg,  S 

Number  of - — 

Hops  Perfect  Imperfect 

per  Leg,  K  Acknowledgments  Acknowledgments 


1 

.058 

.058 

2 

.045 

.033 

3 

.044 

.0290 

4 

.044 

.0286 

6 

.044 

.0286 

Table  3;  Effects  of  Passive  Acknowledg¬ 

ments  on  Star  Networks. 

Full  Connectivity  - 
Duplicates  Detected 


Maximum  throughput 
per  Leg, 


Number  of 

Number  of 
Hops 

Perfect 

®max 

Imperfect 

Legs,  L 

P«r  ‘eg,  K 

Acknowledq . 

Acknowledq . 

6 

3 

.04 

.0233 

4 

.04 

.0230 

>5 

.04 

.0230 

9 

”3 

.0286 

.0155 

4 

.0280 

.0154 

>5 

.0230 

.0154 

1.  Each  node  has  only  one  chance  to  hear 
acknowledgments  from  its  neighbors. 

2.  Since  the  probability  of  a  node  hearing 
an  acknowledgment  from  its  neighbors  is 
significantJy  less  than  one,  some  of  the 
successful  transmissions  are  duplicate 
transmissions . 

3.  The  next  node  must  recognize  duplicate 
transmissions,  otherwise  the  actual 
throughput  will  go  to  zero  when  the 
number  of  hops  between  source  and 
destination  nodes  is  large. 

4.  Since  each  node  must  wait  until  trans¬ 
mission  of  the  packet  by  its  neighbor  n 
hear  acknowledgments  (or  that  parxet, 
the  timeout  period  is  large.  Thus, 
delay  will  be  large  too. 

We  propose  the  following  protocol.  A  node 
in  general  has  many  packets  to  transmit,  in 
addition  to  repeating  a  packet  )ust  received 
These  packets  are  intended  for  all  its  neighbors 
The  old  protocol  requires  one  to  wait  until  that 
particular  packet  is  repeated.  Here  we  enlarge 
the  header  of  every  packet  so  that  it  includes 
acknowledgment  information  for  all  recently  re¬ 
ceived  packets.  Thus  a  "passive"  acknowledg¬ 
ment  can  be  received  on  the  next  packet  trin.-.- 
mitted.  The  acknowledgment  for  a  particular 
packet  can  be  included  m  the  next  several  pack¬ 
ets,  so  there  are  several  attempts  to  hear  the 
acknowledgment.  At  the  expense  ol  increased 
header  and  more  logic  in  the  nodes,  the  effect  ol 
passive  acknowledgments  can  be  made  negligible 


The  protoiol  lh.il  we  h.ive  developed  to 
overcome  these  ptoplems  is  th.it  .1  node  recoijuizes 
j  passive  ackriowU-di-jmeut  lor  its  pacKet  hy  the 
virtual  transmist.ion  of  tn.ii  p  icket  A  pai  kei  is 
virtually  transmitted  if  its  identifier  is  tr.insmiited 
with  another  packet.  Lach  node  when  trans¬ 
mitting  a  packet  includes  not  only  the  packet's 
identifier  but  also  the  identifiers  of  previously 
received  packets.  A  packet’s  idcntilier  is  con¬ 
tained  in  the  next  m  packets.  A  node  would  then 
have  m  opportunities  to  hear  if  its  transmission 
was  received  (we  d.i  not  count  the  actual  trans¬ 
mission).  If  a  packet's  identifier  is  transmitted  m 
times  then  the  acknowledgment  for  that  packet  is 
almost  always  heard  tor  a  sufficiently  large  value 
of  m.  If  a  packet's  transmission  is  not  received, 
then  after  a  timeout  period  that  packet  is  re¬ 
scheduled  for  transmission 

VI.  The  Effect  of  the  New  Protocol 
on  Throughput 


Inserting  n  identifiers  into  a  pi.  kei  wni 
increase  a  packet  length  by  a  la<  l.,r  (see  (  i.j 


h  ed  efnen  b 


(2.  ) 


The  value  of  n  is  related  to  m  as  will  be  shown 
later  Here  n  is  the  number  of  diiferent  idiTiti- 
tiers  added  to  the  packet  for  the  moment  we 
will  assume  that  the  aver  age  value  .if  n  e.iu.ii  ,  ii 

I'he  value  of  m  is  chosen  such  th.ii  the 
probability  of  hearing  an  acknowledgment,  is 

close  to  one.  Since  every  node  hears  acknowl¬ 
edgments  from  its  neighbors  with  probability  close 
to  one  there  are  almost  no  duplicate  transmis¬ 
sions.  Thus  nodes  do  not  have  to  recognize 
duplicate  transmissions  for  the  sake  of  through¬ 
put. 

If  each  node  virtually  transmits  a  packet  m 
times  then  the  probability  of  successfully  hearing 
an  acknowledgment  can  be  recomputed  as  follows: 


We  develop  and  analyze  a  new  acknowledg¬ 
ment  protocol  which  we  refer  to  as  the  virtual 
acknowledgment  protocol.  In  this  protocol,  if 
node  j  hears  a  packet  transmitted  to  it  by  node  i, 
node  j  includes  the  identifier  of  this  packet  in  the 
next  m  packets  it  transmits  to  any  node  (in 

addition,  of  course,  to  later  transmitting  the 
packet  itself).  We  refer  to  this  as  m  virtual 
transmissions  (and  one  actual  transmission)  of  the 
packet.  Node  i  then  has  m  opportunities,  rather 
than  one,  to  hear  that  node  j  received  its  packet. 
Thus  the  probability  of  an  acknowledgment  being 
heard  is  greatly  increased  over  what  it  was  with 
passive  acknowledgments.  This  increases 
throughput  significantly,  roughly  50%  for  the 
topologies  we  have  studied. 

In  addition  to  this,  node  i  can  make  the 

determination  as  to  whether  or  not  to  transmit  as 
soon  as  it  hears  any  packet  transmitted  by  node 
j.  Thus,  node  i  does  not  have  to  wait  for  its 
packet  to  be  scheduled  for  retransmission  and 
retransmitted  by  node  j.  This  reduces  delay 
significantly . 

We  present  the  analysis  of  throughput  and 
overhead  due  to  the  increased  length  of  the 
header  and  give  a  general  procedure  for  com¬ 

puting  these  quantities.  We  also  present  analyses 
of  several  specific  topologies  of  interest  and 

compare  the  performance  of  this  protocol  with  that 
of  passive  acknowledgments  for  these  topologies. 


h  bits  b  bits  d  bits 


Header 

ID 

Data 

1 

2 

(n+l) 

Header 

ID 

ID 

. . .  ID 

Data 

h  bits  (n'tl)b  bits  d  bits 
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Packet  Formats  for  the  Old  and  New  Protocols 


Let  be  the  probability  of  hearing  an  ack¬ 
nowledgment.  Then  the  probability  of  not  hearing 
an  acknowledgment  after  m  virtual  transmissions  is 


q*  =  1  -  (l-qjj)'"  (23) 

For  large  values  of  m,  q.-  goes  lo  1.  When 

* 

q.j  =  1  the  throughput  (in  bits/sec)  will  increase 
to  the  results  for  perfect  capture. 


q.j  is  substituted  for  q|j  in  all  throughput 

equations  for  chain  and  star  networks  which  we 
presented  in  the  previous  sections.  The  maximum 
obtainable  throughput  for  different  topologies  .ire 
given  in  Tables  5  and  6.  The  probability  of 


Table  5. 

Effect  of  imperfect  acknowledgments  on 
for  the  chain  network  (Sj-  =  S.^  =  S). 

the 

throughput 

as  a 

function  of 

Fhroughput  of  a  chain 

Length 

number 

of  virtual 

transmissions,  m 

Perfect 

of  Chain 

Acknowl¬ 

(N) 

1 

2 

3 

4 

5 

6 

7 

8 

edgment 

4 

.  106 

.119 

124 

.127 

127 

.1275 

.1276 

128 

.128 

5 

.083 

.1 

.106 

.108 

.11 

.1105 

.1108 

.1109 

111 

6 

.072 

.09 

.096 

.  1 

.101 

.102 

.  102 

.  102 

102 

7 

.  06t) 

.085 

.091 

.095 

.096 

.097 

.097 

.097 

097 

8 

063 

081 

.088 

.091 

.093 

.094 

.034 

091 

.094 

10 

06 

.078 

,085 

(188 

.09 

.09 

.0903 

.0905 

091 

389 


Table  6.  effect  of  imperlert  .irknowlpdgrtM'nis  on 
the  throughput  ds  i  iunction  ol  m  lor 
five  legs  in  a  star  network  with  ?ero 
connectivity  (S  =  S  =  S). 


Throughput  of  a  star 
with  five  legs 
number  of  virtual 
transmissions,  m 


Length 

Perfect 

Acknow¬ 

ledgment 

of  Legs 

(K)  123 

4 

5  6 

1  .058  .058 

058 

.058 

2  .033  .039  .042 

.043 

.044  .0445 

.045 

3  .029  .037  .04 

.042 

.043  .0433 

.044 

>  4  .029  .037  .04 

.0415 

.042  .043 

.044 

hearing  acknowledgments  is  also  given  in  Table  7, 
We  see  from  Table  7  that  tne  probability  ol 
hearing  acknowledgments  approaches  1  when 
m  =  5  for  zero  connectivity.  The  throughput 
almost  increases  to  the  values  of  perfect  acknowi 
edgment  for  m  =  5. 

We  compared  the  two  protocols  for  acknowl¬ 
edgments  for  chain  and  star  networks.  The 
throughput  of  the  center  node  increased  by  SOf, 
for  a  ten  node  chain,  by  48%  for  zero  connec¬ 
tivity,  and  by  more  than  50%  for  full  connec¬ 
tivity. 

As  we  mentioned  before  the  length  of  a 
packet  is  increased  by  a  factor 

r  _  h  ♦  d  ♦  (m+1)  •  b 
f - -R  T  J"*  b - 

because  of  the  virtual  transmissions.  We  inserted 
m  ■  b  extra  bits  into  a  packet.  Thus,  the 
throughput  is  given  by 

S,|  =  G,|  .  P(N,,  »,)  ■  q,*  ■  h  ,  S  :  .  1, 

(29) 

If  we  choose  b=12,  h=84,  m  =  5,  and  d  =  960 
bits  then 


h  ♦  d  ♦  b 
h  ♦  d  ♦  (m+1)  •  b 


946 


or 


m  •  b 
h  ♦  d  ♦  b 


057 


This  means  that  the  throughput  decreased  6%  due 
to  increased  header  when  compared  with  perfect 
acknowledgments.  But  we  know  that  the 
throughput  increased  much  more  than  (>%,  because 
of  the  new  protocol.  The  results  in  these  tables 
were  obtained  by  using 

Sjj  =  Gjj  •  P(N..  Nj)  •  q*  (30) 

In  order  to  incorporate  Eg,  (29),  the  results 
in  Tables  5  and  b  are  divided  by  Eg.  (27). 


Table  7  Probability  of  hearing  arknnwiednment 
(q.j)  as  a  function  of  m  tor  tive-letjs 

of  length  5  in  a  star  network  with 
zero  connectivity. 

number  of  virtual  transmissions,  m 


3  4  5  6  7 


dot  .530  .700  .810  .86  .91  .94  .95 
q,2  .670  .890  .960  .990  .997  .999  1 
q23  .890  .980  .990  .999  1 

gaq  .910  .990  .994  .999  1 
I  1  I  1 

q,o  .910  .990  .999  1 

da,  .930  .990  .999  1 

dsa  990  .990  .999  1 

do  .970  .999  1 

qs«  1 


8  10 
97  .99 


Vll.  Optimum  m  for  the  New  Protocol 

In  the  previous  section  we  have  studied  the 
effect  of  the  new  protocol  on  throughput.  We 
found  that  for  m  >4  the  star  network  with  full 
connectivity  retains  its  superiority  with  respect  to 
the  her  connectivities.  In  this  section  we  will 
more  closely  study  the  L-leg  star  network  with 
full  connectivity.  , 

We  know  that  increasing  m  will  increase  qj. 

and  hence  throughput.  There  is,  however,  a 
tradeoff  between  the  throughput  and  overhead 
(due  to  the  increase  in  the  packet's  length)  due 
to  virtual  transmissions.  For  several  cases  we 
find  the  optimum  m.  First  we  assume  that  at  all 
nodes,  if  a  packet's  identifier  is  transmitted  vir¬ 
tually  m  times,  then  we  also  insert  m  identifiers 
of  received  packets  into  the  header  of  a  packet 
which  is  ready  for  transmission.  The  next  m 
transmissions,  after  receipt  of  the  packet  in 
question  will  include  the  identifier  of  that  packet. 

For  this  case  the  throughput  is  given  by 

Sq  *  °il  ■  ’’‘"l-  "l*  •  “lil  ■  h  ♦  d  1  Ul"  ■  b 

(31) 


where  j  is  the  one  of  i’s  neighbors. 

The  results  using  Eq.  (31)  were  computed 
for  the  9-leg-star  network  for  various  values  of  m 
when  k  =  4.  The  optimum  m  is  10  for  this  case. 

In  Eq.  (27)  we  assume  that  the  increment  in 
a  packet's  length  is  m  •  b  bits.  But  we  know 
that  the  hotspot  (node  0)  and  nodes  at  the  end  of 
the  legs  do  not  have  to  send  passive  acknowledg¬ 
ments  because  the  transmissions  into  the  hot.spot 
and  nodes  at  the  end  of  the  legs  will  never  have 
collisions.  Thus  there  is  no  increment  in  the 
packet’s  length  for  these  nodes.  Equation  (31) 
can  be  modified  to  obtain  exact  results  as  follows 

Again  we  assume  Ih,  t  node  0  sends  S  units 
of  traffic  in  each  direction.  Then  the  nodes  that 
have  m  •  b  bits  increment  in  their  packet's  length 
must  send  I  •  S  units  of  traffic  to  their  neigh¬ 
bors  where 


-4 


)90 


h  *  d  ♦  (m*l  )h 
h  ♦  d  ♦  b 


For  left  to  right  transmissions  the  through¬ 
put  equations  are: 

Soi  =  S  =  Goi  •  P(No,  N,) 

Sj.i.l  =  '  ■  S  =  ■  P(Ni.  N.,^) 

di  for  1  =  1,2 .  K-1 

For  right  to  left  transmissions:  (32) 


’k,k-l 


=  S  =  G 


k,k-l 


P(N. 


'k-1' 


Sin.i  =  •  •  Ni*i) 

•  q*^i  ,  for  i  =  0,1 .  k-2 

We  expect  that  the  results  from  Eq.  (32)  are 
greater  than  the  results  from  Eq.  (31)  because 
Eq.  (31)  is  solved  assuming  that  the  packet's 
length  is  incremented  by  m  •  b  bits  at  all  nodes. 

The  results  for  Eq.  (32)  were  computed.  We 
found  that  the  optimum  m  is  11  and  the  total 
throughput  of  the  hotspot,  S,j,,  is  .2222.  When 

m  >  11  the  throughput  decreases  because  of  over¬ 
head  due  to  virtual  transmissions. 

We  know  that  q.,  =  1  when  m  =  4  for  all  i 

*  M 

and  j  except  q,2.  This  means  that  nodes 

1,3,4,...,  k-1  need  transmit  a  packet's  identifier 
only  four  times.  Let  us  assume  that  node  2 
transmits  m  times  and  all  the  other  nodes  transmit 
m  times.  Then  the  packet's  length  is  incremented 
by 


Ij  =  m  -  b  bits  for  i  =  1,3,4,...,  k-1 

I2  =  m  •  b  bits  (33) 

'k  ■  '2k  ■  •  •  •  “  *LK  *  ° 

With  these  assumptions  Eqs.  (32)  become 

for  left  to  right  transmissions 

S  =  Soi  =  Goi  •  P(No,  N,) 


I  •  S^=G.  j,,  •  PfNj,  N.,^) 

•  <i.l'  ‘  =  1-3,4 .  k-1^ 

r  •  S  =  G23  •  P(N2,  N3)  •  q23 

for  right  to  left  transmissions  we  have:  (34) 


I  •  S^=Gi,^  .  .  P(Ni,  N.,,) 

•  i  =  0,1, 2, 3 .  k-1 

r  •  s  -  G2,  •  P(N,,  N2)  ■  qli 


S  =  S^,k-1  =  Gk,k-1  ■  \-ll 

where 


_  h  ♦  d  ♦  (m*l )b 
h  ♦  d  ♦  b 


and 


I  •  -  h  ♦  d  ♦  (m'»l)-b 
h  ♦  d  ♦  b 


(35) 


(36) 


For  h  =  84,  b  =  12,  d  =  960,  and  m  =  4  the 
optimum  m'  is  19,  and  the  total  throughout 
of  the  hotspct,  S—,  is  .2390.  With  m  =  4  and 

$  A 

m'  =  19  all  q.j  =  1,  therefore,  the  nodes  will  not 

send  any  significant  number  of  duplicate  trans¬ 
missions  to  the  next  hop.  Since  for  node  2, 

m'  =  19,  the  increment  in  a  packet's  length  there 
is  22%  (228  bits).  For  nodes  1  and  3,  the  incre¬ 
ment  in  the  packet's  length  is  4.5%  (48  bits). 

We  can  further  increase  the  throughput  by 
reducing  overhead  as  follows: 

A  node,  i,  receives  units  of  the 

successful  transmission  (throughput)  from  its 
neighbors  where 


’R(i) 


I  Sjj  (37) 

»  )* 
jcN. 

For  each  unit  of  successful  transmission  node  i 
sends  m.  passive  acknowledgments  so  that  the 
total  number  of  acknowledgments  at  node  i  is 
m.. 


’R(i) 


per  unit  of  time.  But  node  i 


transmits  with  a  rate  of  G, 


that  G. 


P(Ni)  i 


j  P(Nj)  and  we  know 
Therefore  each  trans- 


Russion  must  carry  Cj  passive  acknowledgments 
where 


(38) 


The  overhead  in  a  packet's  length  at  node  i 
is  given  by 


1.  =  rj  •  b  ,  for  i  =  0,1,2,...,  k. 

If  we  assume  that  the  hotspot  (node  0)  and 
nodes  at  the  end  of  each  path  (leg)  want  to  send 
S  packets/unit  time  then  the  throughput  is  given 
by 


h  ♦  d  ♦  rj  •  b  , 

Sij  =  -Hi  .  •  S  =  Gij  •  P(N.,  Nj)  .  q.j 

(39) 

for  i  =  0,l .  k,  and  where  j  is  a  neighbor  of 

node  i. 

For  instance  for  the  transmissions  from  left 
to  right  we  have: 

when  i  =  0 

&01  “  S  =  Gqj  •  P(N(),  Ni)  •  qoi 
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i  =  1 


Table  8. 


Total  throughput  of  the  hotspot 
m  a  star  network  (m  =  4,  L  =  9, 

K  =  4)  for  different  values  of  d. 


•4 


Sqi  *  Sji 

h  .  d  ♦  m  •  •  b 

S,2  = - -  - 

h  ♦  d  ♦  b 

■  S  =  G,2  •  P(N,,  Nj)  •  qts 

i  =  2 

Sij  ♦  S32 

h  ^  d  t  m-  (  gT-:-^,))  •  b 

1>23 - —  ~ 

h  ♦  d  ♦  b 

•  S  =  G23  •  P(N2,  N3)  •  q23 
i  =  k-1 


d 

m ' 

St 

%  Reduction 
from  perfect 
acknowledgments 

50 

16 

.2074 

18 

100 

20 

.2173 

14 

200 

15 

.2243 

11 

400 

19 

.2355 

7 

800 

27 

.2440 

3 

1600 

35 

.2489 

1 

VIII.  Conclusions 


h  ♦  d  ♦  m 


,^k-2.k-l  *  ®k,k-l  . 


•  b 


’k-l,k 


h  ♦  d 


S  ^  ^k-1 


P(N 


k-r 


Nk)  •  q 


k-l,k 


Equation  (39)  was  used  to  compute  through¬ 
puts  for  m  =  4,  t.  =  9  and  various  m'  in  a  star 
network.  We  found  that  the  total  throughput  of 

the  hotspot,  S,j,.  is  .2454  and  the  optimum  m'  is 

27.  Even  though  node  2  transmits  a  packet's 
identifier  27  times  the  increment  in  the  packet's 
length  is  only  78  bits  which  is  a  7%  increment. 
This  means  that  at  node  2  each  transmission 
carries  6.5  identifiers  (r2  =  6.5).  We  also  found 

the  ri  =  3.79  and  r3  =  3.82  which  are  4%  incre¬ 
ments.  Previously  we  saw  that  there  was  a  46% 
reduction  in  throughput  due  to  imperfect  acknow¬ 
ledgments  when  we  used  the  old  protocol  (no 
virtual  transmissions).  Here  we  sec  that  with  the 
new  protocoi  there  is  only  a  2.6%  reduction  in 
throughput  with  respect  to  the  result  for  perfect 
acknowledgments  (.252).  Up  to  this  point  we 
assumed  that  the  original  length  of  a  packet  is 
1056  bits  (header  =  96,  data  =  960). 

The  effect  of  the  overhead  depends  also  on 
the  length  of  the  data  portion  of  a  packet.  If 
the  data  portion  of  a  packet,  d,  is  small  the 
overhead  will  be  large  and  the  throughput  will  be 
reduced.  We  have  solved  Eq.  (39)  for  different 
values  of  d.  The  results  are  given  in  Table  8. 
In  Table  8  we  also  tabulate  the  reduction  from 
the  result  lor  perfect  acknowledgments.  We  see 
from  Table  8  that  when  d  increases  the  through¬ 
put  also  increases  and  the  overhead  gels  smaller. 
When  d  =  50  bits  the  overhead  at  node  two  is  36% 
but  when  d  =  2000  the  overhead  is  only  4%.  The 
reduction  in  throughput  from  the  result  of  perfect 
acknowledgment  is  18%  when  d  =  50  but  it  is  only 
.9%  when  d  =  2000 


We  have  shown  that  the  performance  of  a 
multihop  packet  radio  system  is  significantly 
reduced  by  the  use  of  passive  acknowledgments. 
We  developed  and  analyzed  a  virtual  acknowledg¬ 
ment  protocol  which  results  in  near-perfect  ack¬ 
nowledgments  with  very  little  overhead.  Thus  we 
are  able  to  overcome  most  of  the  difficulty  due  to 
imperfect  passive  acknowledgments.  We  believe 
that  this  virtual  acknowledgment  scheme  will  also 
have  a  favorable  impact  on  time  delay  and  are 
currently  in  the  process  of  analyzing  this  prob¬ 
lem. 
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ABSTRACT 

Multihop  packet  radio  networks  operating 
under  CSHA  with  perfect  capture  have  been  ana¬ 
lyzed  with  an  exact  Markov  procedure  for  expo¬ 
nentially  distributed  packet  lengths.  In  this 
paper  we  generalize  this  procedure  and  improve 
its  computational  complexity.  We  first  prove 
that  the  exact  expressions  of  the  Markov  model 
apply  for  arbitrary  packet  length  distributions 
with  rational  Laplace  transforms.  The  analysis 
is  shown  to  depend  on  mean  values  only,  and  it  is 
extended  to  networks  where  the  average  packet 
lengths  depend  upon  the  destination.  Finally,  it 
is  demonstrated  how  certain  properties  of  the 
analytic  expressions  permit  Che  decomposition  of 
large  networks  into  computationally  manageable 
segments . 

I.  INTRODUCTION 

Packet  radio  networks  consist  of  geographi¬ 
cally  dispersed  Packet  Radio  Units  (PRU's)  broad¬ 
casting  data  over  a  limited  distance.  In  case 
two  units  cannot  have  a  direct  connection,  inter¬ 
mediate  PRU's  will  act  as  relays,  thus  creating  a 
multihop  communication  network.  This  is  repre¬ 
sented  by  a  graph  with  nodes  as  sources,  desti¬ 
nations,  and  relays,  and  arcs  connecting  nodes 
which  are  within  each  other's  range.  Several 
technologies  and  routing  protocols  have  been 
proposed,  e.g.,  [1]  and  [2],  in  order  to  estab¬ 
lish  reliable  end-to-end  paths  for  maximum  net¬ 
work  throughput.  The  need  to  evaluate  different 
protocols  and  routing  procedures  initiated  sev¬ 
eral  analytic  and  simulation  studies  on  packet 
radio  network  models.  In  [3]  Kleinrock  and 
Tobagi  analyzed  single-hop  centralized  networks 
employing  Carrier  Sense  Multiple  Access  (CSHA). 
In  this  context  they  demonstrated  the  effect  of 
propagation  delay  and  of  hidden  terminals  (PRU's 
transmitting  to  the  central  hub  but  not  listening 
to  each  other).  In  [4]  Tobagi  considered  a 
sisiple  finite  state  model  for  two-hop  networks. 
In  [5]  and  [6]  multihop  packet  radio  networks 
using  a  slotted  ALOHA  protocol  were  analyzed.  In 
[7]  Boorstyn  and  Kershenbaum  introduced  an  ana¬ 
lytic  procedure  under  which  multihop  CSMA  net¬ 
works  with  perfect  capture  can  be  analyzed 
efficiently  in  terms  of  their  throughput  perfor¬ 
mance.  This  technique  yields  exact  results  if 
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certain  assuaiptions  are  made,  the  most  important 
of  which  implies  a  Markov  property  as  the  under¬ 
lying  process.  Namely,  scheduled  packet  trans¬ 
missions  are  assumed  to  form  independent  Poisson 
processes  at  each  node  and  packet  lengths  are 
exponentially  distributed.  (A  discussion  of  how 
these  assumptions  relate  to  a  more  realistic  model 
is  given  in  [7].  The  most  critical  of  these 
assumptions  is  a  long  rescheduling  time.)  This 
technique  was  used  to  evaluate  deployments  exhib¬ 
iting  various  degrees  of  connectivity  and  was 
extended  to  investigate  passive  and  active  ac¬ 
knowledgment  protocols  [8]. 

In  this  paper  we  prove  that  the  Markov  anal¬ 
ysis  presented  in  [4]  holds  for  arbitrary  packet 
length  distribution  thus  relaxing  the  exponential 
length  assumption.  Similarly  it  is  deawnstrated 
that  PRU's  transmitting  (or  relaying)  different 
length  packets  to  their  neighbors  can  be  easily 
handled  via  the  same  technique.  These  results 
prove  to  be  insensitive  to  the  packet  length 
distribution  under  the  perfect  capture  assumption. 
Finally,  it  is  shown  how  certain  properties  of  the 
probabilirv  measure  on  node  sets  decompose  complex 
topologies  and  accelerate  the  computational 
procedure . 

II.  THE  NETWORK  MODEL 

We  assume  that  a  large  number  of  PRU's, 
dispersed  within  a  geographical  area,  transmit 
packets  of  data  addressed  to  some  destination 
nodes.  Due  to  the  broadcast  mode  of  transmission, 
all  neighboring  node  PRU's  within  the  transmit¬ 
ter's  range  bear  the  packet  and  depending  on  its 
address,  either  discard  it  or  relay  it  until  it 
reaches  its  eventual  destination.  This  deployment 
is  represented  by  a  graph  whereby  neighboring 
PRU's  are  connected  with  an  arc.  All  units  trans¬ 
mit  and  receive  on  the  same  channel  via  a  Carrier 
Sense  Multiple  Access  (CSMA)  protocol.  Before 
initiating  a  transmisaion,  a  source  (or  relay) 
checks  whether  all  its  neighbors  are  idle.  A 
collision  may,  however,  result  due  to  "hidden" 
terminals  outside  the  range  of  a  PRU  initiating 
transmission  but  interfering  with  the  receiving 
node  (the  receiver  is  a  common  neighbor  to  two 
non-neighboring  transmitters).  Collisions  are 
detected  via  a  perfect  acknowledgment  protocol 
whereby  acknowledgments  are  assumed  to  be  always 
heard  and  do  not  consume  any  network  capacity. 
Packets  encountering  busy  carrier  condition  or 
suffering  collisions,  are  rescheduled  for  trans¬ 
missions  after  a  long  randomized  interval. 


A  PRU  may  not  transnit  and  receive  sioultane' 
ously.  It  can,  however,  lock  its  receiver  to  the 
first  packet  addressed  to  it  and  ignore  subse* 
quent  colliding  broadcasts.  This  is  referred  to 
as  "perfect  capture"  in  packet  radio  networks. 


III.  ANALYSES  FOR  PACKET  LENGTH  DISTRIBUTIONS 
WITH  ARBITRARY  RATIONAL  LAPLACE  TRANSFOICiS 

For  a  given  node,  i,  the  total  scheduling 
process  out  of  i  will  be  Poisson  with  rate  g 


For  networks  operating  as  above,  our  primary 
objective  is  to  assess  the  performance  of  a 
routing  protocol,  as  defined  by  relaying  deci¬ 
sions,  under  a  given  source-destination  traffic 
profile.  Formally,  given  the  network  topology 
(location  of  nodes  and  connectivity),  the  packet 
rates  and  packet  length  averages  per  source- 
destination  requirement  (end-to-end  traffic)  and 
the  routing  scheme,  we  can  easily  deduce  s.  ,  the 

cate  in  packets/sec  that  node  i  has  to  deliver 
successfully  to  its  neighbor  node  j.  Then,  we 
want  to  evaluate  g^^ ,  the  rate  at  which  node  i 

schedules  potential  packet  transmission!  to  its 
neighbor  j.  The  ratio  s../g..  denotes  the  prob¬ 
ability  of  successful  transmission  of  a  scheduled 
packet  and  provides  a  measure  for  stability  and 
delay  since  successful  packets  must  retry  after  a 
random  delay.  Extensions  to  delay  analyses  and 
effects  of  acknowledgment  schemes  are  not  con¬ 
sidered  in  this  paper. 


*i  =  ^ 

j£N. 


IJ 


(1) 


where  is  the  set  of  neighbors  of  i.  .Yode  i, 
generates  packets  with  lengths  having  a  distribu¬ 
tion  with  rational  Laplace  transform.  Let  — 
be  the  average  packet  length  from  node  i. 


Let  V  denote  the  set  of  all  nodes.  At  any 
instant  in  time,  the  system  will  be  characterized 
by  the  subset  of  nodes  D  which  are  simultaneously 
transmitting.  Obviously  nodes  in  D  may  not  be 
neighbors. 


The  main  theory  of  this  section  states  that, 
under  the  assusiptions  above,  the  steady  state 
probability  of  D,  Q(D)  is  given  by; 


Q(D)  =  Q(D-i) 


fi 

►'i 


(2) 


The  analytic  technique  described  below  will 
yield  an  exact  algorithm  for  g. .  under  the  fol¬ 
lowing  assusiptions: 


or 


Q(D)  =  Q(0)  n 
ieD 


(3) 


1)  Zero  propagation  delay  between  neighbors. 
This  implies  perfect  CSHA  and  does  not  allow 
any  collisions  other  than  those  due  to 
hidden  terminals.  It  is  a  valid  approxima¬ 
tion  for  deployments  within  reasonable 
distances. 


where 


1 


I  n 

dCV  ieD 


fi 

‘^i 


2)  Packet  lengths  are  distributed  according  to 
any  distribution  having  a  rational  Laplace 
transform.  This  class  includes  almost  all 
distributions  used  in  practice,  either 
directly  or  as  limits  from  within  the  class. 

3)  Receivers  capture  the  first  transmission 
that  reaches  them  (perfect  capture). 

4)  Perfect  acknowledgments. 


»i 

Here  we  define  H  —  =lifD=*. 
ieD  *^i 

These  results  ace  identical  with  those  reported  in 
(7]  for  exponential  packet  lengths.  Thus,  all 
results  in  [7]  apply  for  general  packet  lengths. 
Namely,  with  denoting  the  set  of  neighbors  of 

Bode  i  and  P(A)  the  probswility  that  all  the  nodes 
in  a  set  A  are  idle,  we  have 


5)  The  streams  of  scheduled  packets  at  the 
nodes  form  independent  Poisson  processes. 
This  assuaqition  is  consistent  with  Poisson 
offered  (exogenous)  traffic  and  long  random 
rescheduling  delays.  Simulation  studies  for 
single-hop  ALOHA  [9]  indicate  that  if  re¬ 
scheduling  is  delayed  more  than  10  times  the 
packet  length,  this  critical  assumption  is  a 
valid  one . 

6)  Packet  lengths  are  assumed  to  be  independ¬ 
ently  reassigned  at  each  node  in  a  path. 
This  is  consistent  with  the  rescheduling 
delay  assumptions  above. 


Pr  {successful  transmission  from  i  to  j ]  = 


s .  . 

=  P(.V.UN.) 
X  .  l  1 

*ij 

Using  the  notation 

SP(B)  A  Z  n  fi 
DCB  IeD 

we  have  from  (3)  that 


P(A) 


Z  Q(D)  = 

DC(V-A) 


SP(V-A) 

SP{V) 


(4) 


(5) 


(6) 


Subsets  D  in  (3)  and  (6)  consist  of  noo-cooBuni* 
eating  nodes  as  in  (3).  Equations  (4)  and  (6) 
fons  the  basis  of  an  iterative  procedure,  which 
along  with  definition  (1),  determines  g  .  values, 

for  given  s  .  rates,  if  they  exist,  and  diverges 


otherwise. 


ij 


D  =  Dj  U  D,  U  .  .  .  U 

Let  Nq  denote  the  set  of  neighbors  of  ail  nodes  in 

D  including  D.  We  will  make  use  of  the  following 
notation  in  order  to  write  state  transition 
(global  balance)  equations 


We  now  proceed  to  the  proof  of  formulas  (2) 
and  (3)  for  general  packet  length  distributions, 
which  are  different  for  each  node.  We  use  the 
method  of  stages  as  developed  by  Cox,  [10],  to 
decompose  arbitrary  service  distributions  to  a 
combination  of  exponential  services.  Cox  demon¬ 
strated  that  any  distribution  with  rational 
Laplace  transform,  can  be  represented  by  the 
general  Erlang-branching  configuration  of  Figure 
1.  As  in  the  figure,  a  node  i  which  initiates  a 
packet  transmission,  activates  the  first  expo¬ 
nential  server  with  rate  With  a  given  proba¬ 
bility  1-P|  the  packet  terminates  transmission  at 

this  stage,  otherwise  it  enters  a  second  expo¬ 
nential  server.  Since  only  one  packet  is  trans¬ 
mitted  at  a  time,  a  busy  node  will  correspond  to 
one  of  the  n  stages  being  active.  The  average 
packet  transmission  time  l/p^,  equals  the  average 

residency  within  the  stage  sequence  of  Figure  1. 


pi  pi 
1  2 
i 
►'3 


P*’  pi. 
1 


n-1 


(7) 


[Dj, . . .  ,Dj^-i, .  .  .  ,D^}  denotes  a  state 

identical  to  E,  with  node  i  removed 
from  the  set 

{D, , . . . ,D, +i, . . . ,D  }  similarly  denotes 
1’  ’  k  n 

the  addition  of  node  i  to  D. 


The  global  balance  equations  will  equate  flow 
into  state  E  to  flow  out  of  state  E.  There  are 
several  possible  ways  of  entering  or  exiting  a 
state.  For  example,  a  new  arrival  to  state 
{Dj-i,  D^,...,  D^}  with  rate  g^  will  yield  state 

E.  A  change  of  stage  k-1  to  k  in  (0^,  D^,  .... 

with  rate  will 

result  in  E.  Accounting  for  all  possible  transi¬ 

tions,  and  denoting  the  stationary  probability  of 
state  E  by  Q(E) ,  we  have; 

g.  Q(Dj-i,  D2,...,D^) 

*  ^  ^  Cl  “^^“l . ®k-l*^'  V' . “n’ 

k=2  ieD,  t  ‘  t  ‘  t  “ 


>  I  Z  (1-P^)  Q(Dj . D^+i,...,D_^)  = 


k=l  i<N, 


*  {  Z  wj  ♦  Z  ^  Mk  ♦  Z  g.}  Q(E)  (8) 

.r  *  i—es  _•  ..ra  ^  .r  * 


icD 


k=2  ieO, 


Fig,  t  stage  Oecameosupn  ‘or  Nods  a 


The  rational  Laplace  transform  class  includes  or 
approximates  all  distributions  found  in  practice. 
As  an  example  of  the  flexibility  of  the  method, 
note  that  a  single  exponential  server  models 
exponential  packet  lengths  and  that  a  series  of 

servers  (P^=l)  approaches  the  constant  packet 
length  case  as  the  number  n  increases. 

The  system  with  decosiposed  nodes  as  above, 
is  a  Harkov  process  with  Poisson  scheduling  and 
exponential  servers.  The  state  of  the  system 
must  contain  information  on  busy  (transmitting) 
nodes  and  their  respective  current  stage  of 
transmission.  Let  D|^  be  the  set  of  busy  PRU's  at 

stage  i.  The  network  state  E  is  the  collection 
of  subsets  The  set  D  of  busy  PRU's  regard¬ 
less  of  stage  is  given  by 


We  observe  that  the  global  balance  equations  (8) 
can  be  decomposed  info  three  sets  of  consistent 
local  balance  equations: 

g.  Q(Dj-i,D2 . D^)  =  Mj  Q(E)  Vi  e  Dj  (9) 

Cl  Cl  Q(Di.---.Vi"^'V^ . V  = 

VieD^,  2  <  k  <  n 
(10) 

Z  (1-P^)  Q(Dj . D,.i . D^)  =  g.  Q(E) 

k=l 

VifNj, 

(11) 

Equations  (9)  and  (10)  equate  the  flow  into  E  due 
to  arrivals  at  stage  k  of  node  i  to  the  flow  out 
of  E  due  to  departures  from  the  same  stage  of  the 
sasw  node.  Equations  (11)  are  obtained  by  con¬ 
sidering  an  additional  stage,  the  idle  stage 


whereby  aode  i  is  ''ile  but  peraitted  to  transait, 
1  e  Sq.  Then  we  equate  the  flow  lato  E  due  to 

arrivals  to  the  idle  stage  of  aode  i  to  the  flow 
out  of  E  due  to  departure  froa  this  stage  of  aode 
1.  Equatioa  (11)  caa  be  obtaiaed  froa  (9)  aad 
(10)  by  rearraagiag  (9)  aad  suaaiag  (9)  and  (10) 
for  all  k's.  Thus  (9)  aad  (10)  constitute  aa 
iadepeadeat  set  of  local  balaace  equatioas,  con- 
sisteat  with  (11)  aad  suaniag  up  to  the  global 
balaoce  equatioa  (8).  For  similar  reasoning  on 
local  balaoce  equations,  see  [11]  and  [12].  In 
[12]  siailar  processes  are  characterized  within 
the  context  of  Harkov  fields  for  spatial  pro¬ 
cesses  . 


From  Che  local  balaace  equations,  it  easily 
follows  that  the  stationary  state  probabilities 
Q(E)  possess  product  form  solutions,  namely 


g.  8-^i 

Q(D,,D  . D  )  =  Q(q)  n  -j  X  n 

^  ^  “  i£Dl  ieDj  Pj 


g.  p^  P*-  B.  P^.  .  .P  *■ 

X  n  — — T — ^x...x  n  — sii 


ieD, 


icD 


(12) 


If  Q(D)  denotes  the  steady-state  probability  that 
the  nodes  in  D  are  busy  regardless  of  the  stage 


of  transmission, 

Q(D)  will 

consist  of  the 

sum  of 

all  states 

{Dj,D 

'2 . “n> 

which 

include 

a  node 

ieO  at  soate 

stage 

k.  ieD,. 

Thus 

Q(D)  = 

Z 

Q(D,.D^,. 

All 

partitions  such 

that 

D 

U  D,  U  . .  . 

U  D 

2 

n 

or 

1 

pj 

PlPl • • 

.p  ^ 

Q(D)=Q(«)  n  g. 

ieD  1 

i-  +  -i  ♦ 

1  2, 

1  ^  ^ 

n-l 

Mj 

i 

^'n 

and  from  (7) 

8. 

Q(D)  = 

Q(«) 

n  —  q.< 

ieD  ►'i 

E.d. 

Thus  the  steady  state  probabilities  of  tbe 

set  of 

busy  nodes 

has 

a  product 

form 

solution, 

,  and 

depends  on  the  average  packet  length  (possibly 
different  for  each  node)  for  packet  length  dis¬ 
tributions  with  rational  Laplace  transform. 
These  distributions  need  not  be  the  same  for  each 
node.  The  number  of  stages  n  used  in  the  proof 
corresponds  to  the  maximum  number  of  stages  in 
tbe  network.  For  nodes  with  smaller  a,  the 
branching  probability  P^  can  be  set  to  zero  to 
truncate  stages. 

IV.  MODES  TRAMSHITTIWG  PIFFEHm  PACKET 
LEMGTHS  TO  EACH  HEIGHBOR 


A  slight  modification  extends  tbe  analysis  to 
scenarios  whereby  a  node  i  transmits  packets  of 
different  average  length  to  each  of  its  neighbors 

j. 


Let  g.  .  and  1/u..  be  the  scheduling  rate  and 
ij  ij 


average  packet  length  for  the  i  to  j  transmission. 
We  keep  the  same  structure  as  before  by  breaking 
the  node  i  into  a  set  of  "micronodes ,”  one  for 
each  neighbor.  Hicronodes  are  connected  in  our 
topology  if  they  can  hear  each  other.  Obviously 
micronodes  belonging  to  the  same  node  i  are  fully 
connected  and  so  are  micronodes  belonging  to  nodes 
connected  in  our  initial  topology.  As  an  exaaiple, 
a  five  node  chain  will  be  decosq>osed  as  in 
Figure  2.  Since  nodes  2,  3  and  4  are  transmitting 
to  two  neighbors  they  are  decomposed  into  two 
micronodes.  By  applying  tbe  results  of  the 
previous  section,  we  have  that  for  a  successful 
transmission  from  2  to  3, 


^  =  P(1  2  2-  3  3-)  =  SP(4,4’.,S) 
*21  ^  ’  ’  ’  SP(V) 


where 


43 


»45 


*54 


SP(4,4’  ,5)  =  1  +  —  +  —  ♦ 


"43 


Mas 


43 


"54 


SSI  Sis  S«i 


Mow,  aefine  G,  =  —  +  —  and  G  =  —  , 

^  *^43  ^^45  5  ^^54 

in  other  words  let  G^  be  the  average  normalized 

scheduling  rate  of  node  i . 

*ii 

G.  =  Z  ^  (13) 

It  follows  that  for  this  example  results  are 
identical  as  in  tbe  original  topology  before 
decomposition  but  with  replaced  by  (13). 

In  general  consider  SP(B*)  where  B*  is  the 
set  of  micronodes  associated  with  the  set  of  nodes 
B.  Let  node  i  e  B  and  denote  its  micronodes  by  ij 
where  j  are  neighbors  of  i.  We  use  tbe  relation 
[5], 


In  tbe  previous  section,  we  assusMd  that 
each  node  i  schedules  packet  transmissions  at  an 
aggregate  rate  g.  and  with  average  length 
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SPtB)  =  SP(.B-i)  +  ^  SP(B-N.)  (14) 

For  micronodes , 

SP(B*)  =  SP(B*-{i,j})  1-  ^  SP(B*-N*.) 

Note  that  because  of  the  coonectivity  of  oicro- 

aodes ,  NA .  is  the  sane  for  all  j,  aaoely  the  set 
ij 

of  raicroaodes  of  N.,  N*.  Let  i*  be  the  set  of 
1  1 

microaodes  of  1.  Then  proceeding  recursively  we 
have 

*ii 

SP(B*)  =  SP(B*-i*)  +  Z  ^  SP(B*-lf*)  (15) 
i  ^ij  ^ 

Using  (13)  and  repeating  the  above  process  we 
find  that  equation  (6)  still  holds  for  the  origi* 
nal  nodes,  but  with  replaced  by  given  in 

(.13).  In  the  special  case  where  all  packets 
transoitted  by  i  have  average  length  l/p.,  then 

G  =  g. /u . . 

1  *1 


As  a  special  case  with  C  containing  one 
nude  i 

SP(V)  =  SP(V-i)  ♦  G.  SP(V-N^)  (20) 

Sets  N.  and  N-  above,  denote  the  sets  of 
1  D 

neighbors  of  i  and  all  nodes  in  D  respectively. 
Finally,  if  C  is  a  cut,  i.e.  a  sec  of  nodes  which 
when  renoved  decooiposes  the  network  into  two 
isolated  subsets  A  and  B,  it  follows  from  (16) 
that  for  all  independent  sets  D(]2C 

SP(V-C-Np)  -  SP(A-Njj)  SP(B-Mjj)  (19) 

and 

SP(V-C)  =  SP(A)  SP(B) 

As  an  example,  consider  the  10  node  network 
illustrated  in  Figure  3.  By  choosing  C  =  (5,6), 
the  network  is  decomposed  into  three  subsets,  A,  B 
and  C  with  A  and  B  being  totally  isolated  from 
each  other.  Applying  (17)  and  (19)  we  can  reduce 
the  coB^utations  needed  to  evaluate  SP(V)  as 
follows : 


V.  COHPUTATION  OF  SP(A) 


SP(V)=SP(1,2,3,4,5,6,7,8,9,10)= 


As  seen  from  equations  (4),  (5)  and  (6) 
evaluation  of  s../g..  requires  an  efficient  way 

of  computing  SP(A)  expressions  for  various  sets 
of  nodes.  Recall  that  SP(A)  stands  for  sums  of 
products  of  rates  G^  on  all  possible  independent 

subsets  of  A.  By  an  independent  set  we  denote  a 
set  of  nodes  which  do  not  communicate  (i.e.  are 
not  neighbors). 


=SP(1,2,3,4,7,8,9,10) 


=SP(1,2,3,4)SP(7,8,9,10) 


+  Gj  SP(1,4,9,10) 

*  Gg  SP(1,2,7,10) 

1-  GjGg  SPd.lO) 

4  Gj  SP(1,4)-SP(9,10) 


SP(A)  =  Z  n  G. 

DCA  ieD  '• 

A  straightforward  algorithm  to  evaluate 
SP(A)  would  require  identification  of  all  inde¬ 
pendent  subsets  of  A,  a  problem  dual  to  identifi¬ 
cation  of  all  cliques  in  a  graph  and  thus  NP- 
coitplete  (a  clique  is  a  fully  connected  subset  of 
a  graph).  It  is  however  possible  to  handle  con¬ 
siderable  size  networks  by  using  several  proper¬ 
ties  of  the  SP(A)  function.  We  suoiscize  some  of 
them  and  demonstrate  via  an  example  how  to  decom¬ 
pose  a  network  into  smaller  segments. 

If  two  subsets  of  nodes  A  and  B  are  isolated 
from  each  other,  then  in  [7]  it  is  shown  that 

SP(AUB)  =  SP(A)  SP(B)  (If) 

If  C  is  an  arbitrary  subset  of  V,  it  can  be 
shown  that 

SP(V)  =  SP(V-C)  +  Z  j  n  G.  •  SP(V-C-Nn)( 

DCcjttD^  M(I7)^ 


♦  Gg  SP(1,2)-SP(7,10) 

♦  GjGg  SP(1,10) 

We  can  now  proceed  to  direct  evaluation  of  sums- 
of-products  for  subsets  with  4  nodes  at-  the  most 
instead  of  10  initially. 


•  c  a 


VI.  CONCLUSIONS 


This  result  is  due  to  Mr.  W.  Chen,  of  the  Bell 
Telephone  Laboratories  and  the  Polytechnic 
Institute  of  New  Yirk  and  can  be  proven  by 
considering  all  terms  not  including  nodes  in  C 
and  then  all  terms  involving  different  inde¬ 
pendent  subsets  of  C. 


We  have  shown  how  our  work  on  an  analysis  of 
multihop  packet  radio  networks,  originally  done 
for  exponential  packet  lengths,  can  be  extended  to 
packet  length  distributions  with  rational  Laplace 
transforms.  Thus  the  results  previously  obtained 
can  be  used  in  this  more  general  case  with  only 


the  average  packet  length  being  required.  It  was 
also  shown  that  the  packet  length  distributions 
transmitted  by  a  single  node  to  different  neigh¬ 
bors  need  not  be  the  same.  Finally,  a  method  was 
described  to  simplify  the  writing  of  the  analytic 
expressions  for  large  networks. 
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B.  Multiple  Access  i  et  hniques  with  Arbitrary 

Pac^t  Length  Disttibutions 
B.l  Intrq^duction 

In  our  ofiyindl  multihop  packet  radio  analysis,  we  assumed  expo¬ 
nentially  distributed  p<icket  lengths.  We  have  been  able  to  generalize 
the  an.ilysi.s  lor  p<i(  kel  lengths  having  densities  formed  by  the  positive 
sum  ot  exponential  terms  (see  Appendix  C).  In  our  analysis  we  as¬ 
sumed  that  pr ()[)<iqation  delays  among  neighboring  PRU's  are  negligible. 
Thus  in  a  Carrier  Serving  Multiple  Access  (CSMA)  mode  of  operation, 
collisions  may  occur  due  to  the  "hidden  terminal"  phenomenon  only 
(i.e.,  two  non-communicating  PRU’s  schedule  packet  transmissions  to  a 
common  neighbor  simultaneously). 

CSMA  analyses  incorporating  the  effects  of  propagation  delays  have 
l)(‘en  reported  extensively  in  the  literature  for  single-hop  networks 
U.C.,  all  Pklcs  tic.ii  each  (hirer')  and  fixed  packet  sites.  As  a  first 
^icfj  in  generali/iny  these  results,  we  studied  single-hop  multiple  access 
protocols  with  non-tixed  packet  lengths.  Although  our  main  thrust  is 
on  CSMA  packs- 1  radio,  we  rilso  derived  formulas  for  pure  ALOHA  and 
(  SMA  with  collisiorr  detrs  tion  (CSMA/CD).  The  former  was  a  necessary 
;;tep  in  order  to  demonstrate  the  impact  of  packet  length  distribution  on 
the  sim[)lest  multiple  access  method,  whereas  the  latter  is  a  straight- 
tnr'w.ird  extension  of  pure  CSMA  and  is  especially  popular  in  local 
netwfrrking  environment;..  Nol('  that  the  pure  ALOHA  case  was  studied 
faeviously  1)|,  wher'c.rs  rro  extension  has  been  reported  on  CSMA  to  our 
knowledcp'.  Hie  i  ksMA  i  :u  result  is  so  simple  that  it  may  already  be 


known . 


In  what  follows,  we  summarize  the  variable  packet  length  analyses 
in  pure  ALOHA,  CSMA  and  CSMA/CD.  In  all  cases  we  assumed  infi¬ 
nitely  many  Poisson  sources  and  Poisson  aggregate  scheduling  pro¬ 
cesses,  with  rates  s  and  g  packets/sec  respectively.  Packet  lengths 
are  distributed  arbitrarily. 

B.2  Pure  ALOHA 

Referring  to  Figure  9,  we  consider  a  transmission  of  length  Y 
(shaded).  This  transmission  will  be  successful  if  a)  no  other  packet  is 
transmitted  in  Y  seconds  and  b)  no  previously  transmitted  packet  is 
still  transmitting.  We  are  assuming  zero  capture.  Calling  these  prob¬ 
abilities  P^  and  Pjj,  we  obtain  s  =  gP^Pj^-  But 

CD 

P  =  /  e'^^  fY(y)  dy  =  MY(-g) 

^  0 

where 

00 

MY(g)  =  ;  e^y  fY(y)dy 
0 

is  the  moment  generating  function  of  Y  and  fY(y)  is  its  density.  Pj^  is 
found  by  considering  the  T  second  interval  prior  to  the  transmission  in 
question.  Assume  transmissions  in  that  interval  occur  seconds 
before  the  start  of  our  test  transmission  and  have  length  Yj.  Then 

P.  =  lim  P(all  T.  >  Y.) 

T->oo  ‘  ~  ‘ 

But  the  number  of  transmissions  in  T  is  Poisson  and  all  are  identically 
distributed  and  independent.  Therefore 
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00 

P(all  T.  >  Y.)  =  I  [P(T.  >  Y.)]^ 

-gT[l-P(T.  >  Y^]  -gTP(Tj  <  Y.) 

=  e  =  e 

Here  T.  is  uniform  in  the  interval  (0,T)  and  Y.  is  distributed  as  Y. 
They  are  independent.  Thus 

P(T.  <  Yp  =  ^  ;  [1  -  FY(t)]  dt 

and 

TP(T.  <  Y.)  f  [1  -  Fyit)]  dt  =  E(Y) 

1  1  0  * 

Here  FY(y)  is  the  distribution  function  of  Y  and  E(Y)  its  expectation. 
Finally  we  have 

s  =  gMY(-g)  . 

Note  that  if  Y  is  fixed  then  s  =  ge"^^  as  it  should. 

But  Y  is  the  length  of  the  transmitted  packets.  Condition  (b) 
above  does  not  involve  the  length  of  the  transmitted  packet  Y.  But 
condition  (b)  does!  Longer  packets  are  more  likely  to  suffer  a  colli¬ 
sion.  Let  X  be  the  length  of  the  offered  (or  successful)  packets.  Y 
should  in  a  sense  be  larger  since  longer  packets  are  retransmitted  more 
often.  Due  to  condition  (a)  alone  a  packet  of  length  x  will  be  suc¬ 
cessfully  transmitted  with  probability  e"^^  and  requires  an  average  of 
e^^  transmissions  to  be  successful.  Thus 

fx(y)/Mj^(g). 
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and 


Pa  =  • 

Thus  _  g  ,  -gM^,(g)/M^(g) 

This  also  reduces  to  s  =  ge"^®^  when  M^^Cg)  =  e^^.  This  result  has 
been  already  established  [5]  but  is  derived  here  in  a  different  manner. 


B.3  CSMA 

Refer  to  Figure  10  for  CSMA.  After  an  idle  period  a  packet 
scheduled  with  rate  g  is  transmitted.  The  packet  lasts  for  X  seconds. 
In  the  propagation  time  a  after  transmission  any  other  scheduled  packet 
can  also  be  transmitted  thus  causing  a  collision.  We  assume  X  >  a. 
Again  we  assume  zero  capture.  If  no  such  packet  is  transmitted,  then 
the  original  transmission  is  successful,  lasts  for  X  seconds,  and  is 
followed  by  an  a  second  period  to  clear  the  channel  and  the  idle  state 
resumes.  Thus  the  successful  rate  is 


But 


s  =  ge~^^  P(channel  is  idle). 
PCchannel  is  idle)  =  ^/g  v 


where  1/g  is  the  average  idle  time  and  Z  is  the  busy  period  exclusive 
of  the  last  a  seconds. 

To  evaluate  E(Z)  we  denote  the  times  of  the  transmissions  of 
interfering  packets  as  T.  and  their  lengths  as  Xj.  The  number  of  such 
packets  is  exponential.  Collisions  depend  only  upon  the  propagation 
time  a  and  not  on  the  length  of  the  transmitted  packet  as  in  ALOHA. 


B.4  CSMA/CD 

Refer  to  Figure  11  for  CSMA/CD  (unslotted).  Here  collisions  are 
detected  and  transmissions  aborted.  Again  packets  are  scheduled  with 
a  rate  g.  After  an  idle  period  a  packet  is  transmitted.  If  no  packets 
are  transmitted  in  the  next  a  seconds  that  packet  is  successful.  After 
a  seconds  to  clear,  the  channel  returns  to  idle.  A  scheduled  packet 
can  be  transmitted  in  the  first  a  seconds  and  will  cause  a  collision. 

k# 

But  it  will  be  aborted  a  seconds  after  the  original  transmission.  The 
original  transmission  will  be  aborted  a  seconds  after  the  start  of  the 
colliding  packet.  Thus,  as  before, 

V# 


rr 
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s  =  ge  ^  P(channel  idle). 

P(c.hannel  idle)  =  i/g  ^(z) 

and  Z  =  X,  with  prob.  e"®® 

min{Tj}  +  a,  with  prob.  1  -  e”®^. 

Here  we  assume,  for  convenience,  that  X  >  2a.  Solving  we  get 
E(Z)  =  E(X)e’9^  +  +  a)  e'^^ 

=  E(X)e‘9^  ^  I  •  (l+ga)e'®®]  +  a(l  -  e'^^) 


or 


s  = 


jae 


-ag 


1  +  gE(X)e‘5^  +  (l+2ga)(l-e‘^®) 


Note  than  only  E(X)  appears  in  the  above  equation 


FIGURE  9 


PURE  ALOHA 


FIGURE  11 
CSMA/CD 
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Evaluation  of  Throughput  in  Multihop  Packet  Radio 
Networks  with  Complex  Topologies 
Aaron  Kershenbaum  and  Robert  Boorstyn 
Polytechnic  Institute  of  New  York 

In  a  previous  paper  [1]  we  developed  an  analytical  model  of 
multihop  packet  radio  networks  operating  under  CSMA  (Carrier  Sense 
Multiple  Access),  Poisson  arrivals,  perfect  acknowledgements  (i.e. 
acknowledgements  always  heard  and  not  taking  any  channel  capacity), 
perfect  capture,  and  zero  propagation  delay.  This  yields  a  Markov 
model  of  the  system  and  the  relationship 

^  =  P(N.U  NJ  (1) 

^ij  ^ 

where  s.^  is  the  required  successful  rate  (in  packets  per  unit  of 
transmission  time)  of  transmissions  from  node  i  to  node  j,  g.^  is  the 
scheduled  rate  of  traffic  from  node  i  to  node  j,  P(A)  is  the  proba¬ 
bility  that  all  nodes  in  set  A  are  idle  (and  says  nothing  about  other 
nodes),  N.  is  the  neighborhood  of  node  i  (i.e.,  node  i  together  with 
all  nodes  which  can  hear  its  transmissions),  and  N^  is  the  neighbor¬ 
hood  of  node  j.  Some  of  the  above  assumptions  about  the  network 
model  can  be  relaxed  to  include  a  more  general  class  of  networks,  but 
we  use  the  simple  model  above  for  the  sake  of  clarity.  It  is  unlikely 
that  such  generalizations  would  significantly  affect  the  computational 
procedure  described  below. 

We  are  given  r.^,  the  required  end-to-end  traffic  from  i  to  j,  for 
all  nodes  i  and  j.  We  assume  a  routing  has  been  done  yielding  hop- 
by-hop  requirements  s.^.  We  are  also  given  the  connectivity  for  each 
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node.  Equation  1  states  that  a  transmission  will  be  successfully 
received  once  it  is  scheduled  if  it  is  transmitted  and  if  no  neighbor  of 
the  receiver  is  transmitting  at  the  time  transmission  begins.  (Our 
assumption  of  perfect  capture  allows  us  to  ignore  interference  from 
any  subsequent  transmissions .  ) 

In  our  previous  work  [1]  we  have  shown  that  the  probability  of 
a  set  of  nodes.  A,  being  idle  is  given  by 

P(A)  =  2  7t  G.  /  I  n  G.  (2) 

ICA^  iel  IC  V  iel 

where  A  is  the  complement  of  A,  (i.e.  all  nodes  not  in  A),  I  is  an 
independent  set  (i.e.  nodes  which  all  cannot  hear  each  other),  and  Gj 
is  the  total  scheduled  rate  from  node  i  (i.e.  G^  is  the  sum  on  j  of 
g.j).  We  refer  to  the  quantities  in  Equation  2  as  sums  of  products  on 
a  set  A  and  denote  it  by  SP(A).  We  can  thus  rewrite  (1)  as 


9ij 


SP(V-(N.t/  N.)) 
SP(V) 


=  F..  (G) 


(3) 


where  G  =  (Gj,  G2,...Gj^)  and  Gj  =  Then  we  can  write, 


s. . 

^ij  "  F^) 


(4) 


We  then  can  iteratively  compute  new  estimates  of  the  g.^.  given 
current  estimates  of  these  quantities  and  hence  the  G. .  Our  previous 
experience,  reported  in  [1]  showed  that  if  the  network  can  support 
the  given  s^^  then  the  iteration  will  converge  if  we  start  with  ^ . 

Note  that  g..  >  s... 

-  ij 

Conceptually,  then,  the  problem  is  solved.  One  need  only  set 
up  the  expressions  for  the  Fjj(G)  and  iterate  (4)  ior  each  i  and  j 
until  successive  estimates  for  the  g.^  converge  to  within  some  given 
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tolerance.  This  was  done  for  small  networks  and  for  larger  networks 
with  symmetric  traffic  and  cor  lectivity;  results  using  this  procedure 
are  reported  in  [l]-[4]. 

For  larger  networks  without  symmetry,  however,  several  signifi¬ 
cant  problems  are  encountered.  Foremost  among  these  is  that,  in 
general,  the  number  of  terms  in  SP(V)  grows  exponentially  with  the 
number  of  nodes  in  V.  Specifically,  the  number  of  terms  in  SP(V) 
equals  the  number  of  independent  sets  in  V.  In  the  worst  case, 

when  all  nodes  are  independent  of  one  another,  SP(V)  includes  a  term 

N 

corresponding  to  each  of  the  2  subsets  of  V.  While  such  a  case 
corresponds  to  an  unrealistic  situation  where  no  nodes  can  communicate 
with  one  another,  many  real  networks  are  loosely  connected  and  thus 
contain  a  very  large  number  of  indepent  sets.  Indeed,  this  number 
will  grow  exponentially  with  N  if  connectivity  (average  nodal  degree) 
does  not  increase  with  N.  This  can  be  seen  by  examining  any  regular 
topology;  e.g.,  a  grid. 

Even  if  the  number  of  terms  in  SP(V)  can  be  controlled,  say  to 
grow  quadratically  with  N,  there  are  still  several  problems.  First, 
SP(A)  must  be  generated  not  only  for  A=V,  but  for  many  other  sets 
as  well.  In  particular,  each  F.j  will  have  a  different  SP(A)  in  the 
numerator.  Also,  F.^  must  be  evaluated  not  once  but  many  times 
before  the  iteration  converges.  Finally,  the  process  of  generating 
the  F.j  must  be  automated  in  order  to  avoid  tediously  having  to  man¬ 
ually  input  the  individual  expressions. 

We  now  describe  a  procedure  whch  overcomes  all  these  problems. 
We  first  observe  that  in  the  pathological  case  of  all  nodes  independent, 
SP(V)  can  be  written  as 
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N 

SP(V)  =  7t  (1+G.)  (5) 

i=l  ‘ 

If  this  expression  were  expanded  directly  into  a  sum  of  products  form 

N 

it  would  indeed  contain  2  terms  as  was  remarked  above.  As  it  is 
written,  however,  only  N  additions  and  N-1  multiplications  are  re¬ 
quired  for  its  evaluation.  While,  again,  the  situation  where  all  nodes 
are  independent  of  one  another  is  unrealistic,  the  observation  is 
nevertheless  important.  It  is  possible  to  make  the  evaluaton  of  SP(A) 
tractable  by  grouping  together  common  subexpressions  and  using  the 
distribution  of  multiplication  over  addition  to  reduce  the  number  of 
arithmetic  operations.  This  is  the  basis  for  the  procedure  described 
below . 

Every  independent  set  either  contains  a  given  node,  i,  or  it 
does  not.  If  the  set  contains  node  i  then  it  does  not  contain  any 
neighbors  of  i.  We  can  thus  evaluate  SP(A)  by 

SP(A)  =  SP(A-i)  +  G.‘SP(A-N.)  (6) 

SP((t.)  =  1 

where  i  is  any  member  of  A. 

I A 1 

This  recursive  expansion  of  SP(A)  can  generate  up  to  2' 
terms,  where  (A|  is  the  cardinality  of  A,  but  fewer  terms  will  be 
generated  in  general  since  A-N.  will  eventually  become  ()>  and  terminate 
the  recursion.  A  more  dramatic  reduction  in  the  number  of  generated 
terms  will  occur  if  we  recognize  sets.  A,  which  have  already  appeared 
and,  hence,  for  which  SP(A)  is  already  known.  In  this  case,  the 
known  value  of  SP(A)  is  used  directly  and  no  further  recursive 
expansion  is  required.  This  corresponds  exactly  to  factoring  out  a 
common  subexpression. 


As  an  example  of  how  this  procedure  works,  consider  the  net¬ 
work  shown  in  Figure  1.  We  begin  by  computing  SP(V).  The  recur¬ 
sive  expansion  of  SP(V)  is  shown  in  Figure  2.  Each  node  in  Figure  2 
corresponds  to  an  expansion  using  (6).  Thus,  for  example,  the  root 
of  the  tree  corresponds  to 

SP({12345})  =  SP({12345}-{1})  +  G. -SPC {12345}-Ni)  (7) 

where  Ni={123}.  The  G.  represent  the  same  term  appearing  in  (6). 
In  each  case  we  expanded  SP(A)  about  the  lowest  numbered  node  in 
set  A.  This  was  done  for  simplicity.  In  fact,  the  selection  of  which 
node  to  expand  about  is  significant  and  will  be  discussed  later. 
Nodes  followed  by  an  asterisk  in  Figure  2  correspond  to  terms  which 
have  already  been  generated  and  therefore  need  not  be  expanded . 
As  can  be  seen,  there  are  6  distinct  nodes  in  Figure  2. 

The  first  six  rows  of  Table  1  correspond  to  the  six  distinct 
nodes  in  Figure  2.  The  seventh  row  of  Table  1  and  the  column 
labeled  NEXT  will  be  explained  below,  as  will  be  the  procedure  for 
generating  Table  1 .  The  first  five  columns  of  the  first  six  rows  in 
Table  1  represent  the  recursive  expansion  of  SP(V)  as  shown  in 
Figure  2.  For  example.  Row  1  in  Table  1  corresponds  to  Equation 
(T)  above  where  Terms  1  and  2  are  the  appropriate  SP(A) 

and  GMULT=G.. 

1 

If  the  column  labeled  VALUE  is  used  to  hold  numerical  values  of 
SP  for  the  sets  given  in  the  column  labeled  SET,  and  GVAL  (i) 
contains  the  current  numerical  value  of  G.,  the  current  estimate  of 
the  scheduled  rate  from  node  i,  then  Table  1  can  be  used  to  obtain 
the  numerical  value  of  one  set  given  values  of  others.  Specifically, 
VALUE(5)  corresponds  to  SP((1»)  and  is  equal  to  1  by  definition.  This 
value  can  then  be  used  to  obtain  VALUE(4): 


1  erm  1 

Term  2 

GMULT 

NEXT 

VALUE 

2 

3 

1 

0 

- 

6 

4 

2 

1 

- 

4 

5 

4 

6 

- 

5 

5 

5 

3 

- 

- 

- 

0 

7 

1 

3 

4 

3 

2 

- 

5 

5 

1 

4 

Table  1  - 

Code  Table 

for  all 

SP(A) 

Figure  3  -  Precedence  graph 


VALUE(4)  =  VALUE  (TERM1(4))  +  GVAL  (GMULT(4)) 

•  VALUE  (TERM2(4)) 

=  VALUE(5)  +  GVAL(5)  *  VALUE(5) 

=  1  +  Gs 

which  is,  of  course,  SP({5}). 

It  is  necessary  that  the  rows  of  Table  1  be  evaluated  in  the 
correct  order  so  that  the  appropriate  VALUE'S  are  available  when 
they  appear  as  terms  in  larger  expressions.  While  this  ordering  is 
not  unique,  there  is  a  precedence  of  rows  imposed  by  Table  1.  This 
precedence  is  shown  in  Figure  3.  An  arc  leads  from  node  i  to  node  j 
in  Figure  3  if  row  i  must  be  evaluated  before  row  j.  This  graph  is 
obtainable  directly  from  Table  1  using  the  columns  labeled  ROW, 
TERMl,  and  TERM2.  Physically,  we  represent  Figure  3  by  the  out¬ 
ward  adjacency  of  each  node  (i.e.  a  list  of  the  endpoints  of  arcs 
leading  out  of  it)  and  the  inward  degree  of  each  node  (the  number  of 
arcs  leading  into  it).  With  this  information  one  can  do  a  topological 
sort  of  the  nodes  (c.f.  Knuth[5]),  i.e.,  we  sort  the  nodes  by  current 
inward  degree,  continually  reducing  the  inward  degree  by  eliminating 
nodes  already  sorted.  Topological  sorting  is  a  simple  procedure 
whose  running  time  is  proportional  to  the  number  of  arcs  in  the 
precedence  graph  and  which  produces  a  list  of  nodes  in  precedence 
order.  In  this  case,  the  resulting  list  would  be  5, 7, 4, 3, 6, 2,1.  Note 
that  each  row  in  Table  1  can  be  dependent  upon  at  most  two  others 
and  so  the  number  of  arcs  in  the  precedence  graph,  the  size  of  the 
adjacency  list,  and  the  run  time  of  the  topological  sort  are  all  linear 
in  the  number  of  rows  in  Table  1. 


The  column  labeled  NEXT  gives  the  next  row  to  evaluate  as 
given  by  the  topological  sort.  Row  5,  corresponding  to  $  is  of  course 
first;  this  fact  is  recorded  in  a  variable  called  FIRST.  Thus,  Table 
1  allows  us  to  evaluate  SP(V)  simply  by  making  a  single  pass  through 
Table  1  and  doing  one  addition  and  one  multiplication  per  row. 

We  have  thus  solved  the  problems  of  how  to  automatically  generate 
a  functional  expression  for  SP(V),  how  to  avoid  having  to  reevaluate 
common  subexpressions,  and  how  to  simplify  the  iterative  calculation 
of  the  G..  With  respect  to  the  last  point,  note  that  Table  1  is  set  up 
only  once.  The  evaluation  of  the  SP(A)  is  then  straightforward, 
requiring  only  K  additions  and  K  multiplications,  where  K  is  the 
number  of  rows  in  Table  1.  Indeed,  one  generation  of  Table  1  suffices 
for  the  evaluation  of  the  G.  for  many  values  of  s.^,  which  would  be 
useful  in  analyzing  variations  in  routing  and  offered  traffic. 

We  require  not  only  SP(V)  but  also  SP(V-(N.  U  Nj))  for  all  i  and 
j  which  are  neighbors  and  have  s.j  >  0.  We  represent  all  these  other 
SP(A)  in  Table  1  in  exactly  the  same  way  as  we  do  SP(V)  and  we 
take  full  advantage  of  common  subexpressions  among  all  SP(A)  and 
SP(V).  Thus,  for  example, 

SP(V-(N2  U  Na))  =  SP({5}) 

which  is  already  in  Table  1,  so  we  do  not  add  a  row  for  it.  Indeed, 
as  it  turns  out  in  this  case,  the  only  addition  to  Table  1  is  necessi¬ 
tated  by  SP(V-(N4  (J  Ng))  =  SP({1}).  This  expands  directly  in  terms 
of  SP((^)  and  hence  necessitates  the  addition  of  only  one  row.  Row  7, 
to  Table  1.  Note  that  the  topological  sort  and  evaluation  of  VALUE  is 
done  for  the  entire  table.  Thus,  for  each  iteration  on  the  G.,  only 
one  pass  through  Table  1  need  be  made  to  obtain  all  necessary  SP(A). 


The  evaluation  of  in  Equation  (3)  amounts  simply  to  dividing  the 
VALUE  in  the  row  corresponding  to  SP(V-(N.  U  f^e  VALUE  in 

the  row  corresponding  to  SP(V).  The  appropriate  row  numbers  are, 
of  course,  recorded  in  a  table  once  during  the  procedure  which 
generates  Table  1. 

The  actual  procedure  for  generating  Table  1  is  now  outlined.  A 
stack  of  sets.  A,  for  which  SP(A)  is  to  be  recursively  expanded  is 
maintained.  Initially,  the  stack  contains  V,  the  complete  node  set. 
It  is  convenient  to  maintain  such  sets  and  the  neighborhood  sets  N. 
as  bit  vectors.  At  each  stage,  the  set.  A,  at  the  top  of  the  stack  is 
popped  and  the  sets  A-i*  and  A-N.^^  are  formed,  where  i*  is  the  most 
desirable  node  in  A  to  expand  upon.  A  single  ordering  of  nodes  in  V 
is  produced  according  to  criteria  defined  below  and  is  used  to  deter¬ 
mine  i*  for  all  A.  Assume  therefore  that  the  nodes  in  the  network 
have  been  numbered  according  to  their  desirability;  i*  then  corre¬ 
sponds  to  the  leftmost  1  in  the  bit  vector  representing  A.  This  value 
should  be  recorded  along  with  the  set  A  so  that  it  can  be  used  as  the 
starting  point  for  the  search  for  the  best  node  in  the  sets  A-i*  and 
A-Nj*.  This  value  in  fact  already  appears  in  the  GMULT  entry  in 
each  row. 

Initially,  Table  1  contains  only  one  row,  corresponding  to  V. 
The  SET  entry  in  row  1  contains  a  vector  of  N  I's.  As  a  set.  A,  is 
popped  from  the  top  of  the  stack,  its  TERMl  and  TERM2  entries  are 
set.  These  should  correspond  to  the  rows  in  Table  1  whose  SET 
entries  contain  A-i*  and  A-N.^,..  A  search  of  the  SET  entries  cur¬ 
rently  in  Table  1  is  made  to  determine  if  these  sets  are  already 
present.  If  so,  the  row  containing  each  set  is  recorded  in  the  TERMl 
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and  TERM2  entries  of  the  current  row.  If  not,  a  row  is  added  to 
Table  1  for  each  new  set,  the  SET  and  GMULT  entries  are  filled  in, 
and  the  new  sets  are  added  to  the  stack  of  sets  to  be  evaluated.  In 
practice,  the  row  number  in  Table  1  rather  than  the  set  itself  is 
placed  in  the  stack  in  order  to  avoid  having  to  search  for  it.  Also  a 
binary  search  tree  or  AVL  tree  [6]  structure  on  the  SET  values 
should  be  kept  to  facilitate  the  search  for  existing  sets.  An  alter¬ 
native  to  this  is  to  hash  these  entries  as  described  in  [6]. 

To  generate  entries  corresponding  to  SP(V-(N.  U  Nj)) ,  these 
sets  are  treated  exactly  like  those  arising  above;  i.e.  a  search  is 
made  for  them  and  if  necessary  they  are  added  to  Table  1  and  to  the 
stack.  The  search  for  i*  in  such  sets  starts  at  node  1. 

We  thus  see  that  both  the  code  generation  procedure  and  evalua¬ 
tion  procedures  are  proportional  to  the  number  of  rows  in  Table  1. 
There  are  other  factors  such  as  the  search  time  for  existing  sets,  the 
search  for  i*  in  each  row,  and  the  generation  of  sets  V-{N. 

These  latter  factors  are  minor,  at  worst  polynomial  in  the  number  of 
nodes.  Our  primary  concern  is  then  with  the  number  of  rows  in 
Table  1 ,  which  can  theoretically  grow  exponentially  with  N ,  the 
number  of  nodes. 

The  number  of  rows  in  Table  1  will  be  kept  down  if  sets  arise 
repeatedly  in  the  expansion  process.  This  is  more  likely  to  happen  if 
a  consistent  oraering  of  the  nodes  is  used  in  expanding  all  sets. 
This  is  in  fact  done,  as  was  mentioned  above.  If  network  connec¬ 
tivity  is  related  to  distance,  a  node  ordering  based  on  location  will 
result  in  similar  subsets  being  generated  during  the  expansion  process. 
More  generally,  any  ordering  based  on  connectivity  is  likely  to  help. 


If  C  is  a  cut,  dividing  V  into  disjoint  sets  A  and  B  with  no  links 

between  them,  as  in  Figure  4,  then  Equation  (8)  becomes 

SP(V)  =  1  [(  n  G.)  SP(^-N-)  SP(B-Nj)]  (9) 

ICC  i£l  ‘  ^  '■ 

Thus,  the  sum  of  products  "factors”.  The  numbering  scheme  will 
ensure  that  the  nodes  in  B-N^  are  left  for  last  and  hence  become  a 
common  subexpression  for  all  subsets  of  SP(A-Nj).  Thus,  the  factors 
in  (9)  will  be  evaluated  separately  and  multiplied  together  at  the  end. 
This  technique  can  be  applied  recursively  to  decompose  very  large 
networks.  It  appears  to  have  the  property  that  Table  1  will  grow  at 
worst  exponentially  in  the  size  of  the  cuts. 

We  have  coded  a  preliminary  version  of  this  procedure  in  order 
to  perform  experiments  to  determine  the  growth  rate  of  Table  1  as  a 
function  of  N  using  various  types  of  connectivity  and  node  orderings. 
Thus  far,  the  following  conclusions  and  observations  have  been  made: 

1.  Random  X  and  Y  coordinates  were  generated  for  N  nodes  in 
a  unit  square.  Euclidean  distances  were  computed.  A 
threshold,  T,  was  varied  from  0  to  2.  Nodes  at  distance  T 
or  less  were  said  to  be  connected.  Experiments  for  N=15 
and  N=30  were  run.  Nodes  were  ordered  based  on  the  sum 
of  their  X  and  Y  coordinates.  For  N=15,  Table  1  typically 
had  about  50  rows,  for  SP(V)  only.  For  N=30,  Table  1 
typically  had  about  120  rows.  The  conclusion  we  draw  on 
the  basis  of  this  limited  data  is  that  Table  1  does  not 
appear  to  be  growing  exponentially.  Indeed,  it  seems  to  be 
growing  only  slightly  faster  than  linearly. 


2.  For  random  connectivity  and  node  ordering  based  on  nodal 
degree  (largest  first)  roughly  80  terms  were  generated  for 
N=15  and  roughly  600  terms  for  N=30.  These  results, 
based  on  few  runs,  are  inconclusive. 

3.  Grid  networks  (nodes  evenly  spaced  over  a  unit  square)  of 
25  nodes  and  varying  connectivity  were  examined  using 
distance  based  connectivity  as  in  1  above.  Table  size 
varied  with  connectivity,  with  the  worst  case  being  130 
terms  at  a  nodal  degree  around  6.  Again,  the  conclusion  is 
that  table  size  does  not  seem  to  be  growing  very  rapidly. 

We  are  currently  implementing  a  more  complete  version  of  the 
procedure  in  order  to  do  more  extensive  testing. 
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I.  INTRODUCTION 


Packet  radio  has  recently  emerged  as  a  viable  technology  for  both  fixed  and  mobile 
computer  communications.  It  utilizes  packet-switched  communications,  which  became 
highly  important  in  computer  communication  networks,  over  broadcast  radio  channels. 

The  original  packet  switching  concept  was  based  on  point-to-point  data  circuits 
interconnecting  network  packet  switches.  With  technology  advances  in  satellite  and  ground 
radio  broadcasting,  packet  radio  networks  have  become  an  area  of  major  interest  [6]  . 
During  the  eariy  1970's,  the  ALOHA  project  at  the  University  of  Hawaii  demonstrated  the 
feasibility  of  using  packet  broadcasting  for  bursty  computer  traffic  [1]  . 

In  a  broadcast  radio  network,  channels  are  shared  by  a  set  of  nodes  and  cannot  be 
dedicated  to  specific  pairs.  Numerous  studies  ( [l]  ,  [6]  ,  [7]  ,  [5]  ,  and  [9]  )  have  shown 
that  "fixed  capacity  assignments"  '.re  wasteful  for  many  applications  compared  to  "random 
access  schemes"  which  result  in  a  dynamic  sharing  of  channel  capacity  without  centralized 
control.  These  studies  focused  on  a  single-hop  network,  whereby  Packet  Radio  Units  (PRU's) 
attempt  to  transmit  data  to  a  single  central  station.  In  conjunction  with  the  fact  that  there 
is  no  control  over  access  to  the  channel,  destructive  errors  result  when  several  packets  are 
received  simultaneously  or  "collide."  These  collisions  reduce  the  channel  throughput  under 
random  access.  ^ 

The  analysis  of  multihop  packet  radio  networks  involving  packet  routing  via  relays,  is 
extremely  difficult.  So  one  must  use  simulation  CS] ,  flO]  or  study  simple  models  to 
measure  network  performance,  mainly  the  network  capacity  or  network  throughput  [8j , 
[14],  and  [15]  .  The  throughput  of  multihop  slotted  ALOHA  network  was  studied  recently, 
[12] ,  and  it  was  found  that  one  of  the  most  important  factors  affecting  it,  is  the 
transmission  radius  (or  transmission  power)  used  by  the  nodes.  To  increase  the  throughput, 
one  may  even  want  to  reduce  the  transmission  power  to  a  level  lower  than  the  maximum 
possible. 

Techniques  which  are  used  to  increase  the  network  throughput  include  Carrier  Sensing 
Multiple  Access  (CSMA)  [13] ,  and  resolution  of  collisions  via  packet  capture  mechanisms. 
The  effect  of  capture  in  increasing  the  network  throughput  was  studied  in  [2]  and  [11]  for 
single-hop  networks.  In  [4]  the  effect  of  capture  on  the  throughput  was  investigated  for 
multiple  transmitters  and  receivers  in  a  slotted  ALOHA  environment. 

The  problem  of  obtaining  exact  expressions  for  throughput  and  blocking  probabilities  in 
multihop  packet  radio  networks  using  CSMv\  access  method  was  considered  in  [3] .  Exact 
results  were  obtained  assuming  exponential  packet  lengths  for  general  topologies.  This 
analysis  has  been  extended  to  constant  packet  length  for  selected  simple  topologies. 


In  this  work,  we  consider  multihop  packet  radio  networks  using  slotted  end  unslotted 
ALOHA  access  methods  with  and  without  capture.  We  consider  a  set  of  identical  PRU's 
randomly  (uniformly)  distributed  in  the  plane.  We  find,  via  analytic  techniques  and 
computer  experimentation,  that  the  optimal  density  of  radio  units  which  maximizes  the 
throughput  of  the  network  is  the  same  for  all  access  methods  considered  and  corresponds  to 
the  previous  found  value  for  the  slotted  ALOHA  case  without  capture  [12]  . 


2.  SLOTTED  CASE  MODEL 


The  nodes  of  the  network  are  assumed  to  be  uniformly  distributed  with  density  X. 

Two  access  methods  are  considered: 

•  No  capture  (slotted  ALOHA). 

•  Capture  (closest  neighbor). 

We  consider  a  multihop  packet  radio  network,  with  transceivers  uniformly  distributed  in  a 
large  geographical  area.  Packet  Radio  Units  (PRU's)  have  equal  transmission  range.  Any 
node  within  the  circle  of  radius  r  around  a  transmitting  node  will  hear  the  transmission  and 
can  respond  to  it.  Each  node  always  has  a  message  to  send  and  will  do  so  whenever 
permitted,  (see  Figure  1). 

Let: 

A|  *  Range  of  reception  for  terminal  i. 

N|  *•  Number  of  terminals  in  the  area  A| 

G  *  Probability  of  a  packet  transmission  during  a  slot  by  a  terminal. 

q|  -  Probability  of  successful  reception  of  a  packet  from  a  transmitting 
terminal  of  the  area  A-. 

qj  *  Pr  I  a  neighbor  transmits  successfully  to  i/A. }  *  Pj  •  1*2  *  ^3» 
where  P^  -  probability  that  i  does  not  transmit. 

Pj  *  1  -  G 

^  - 


Probability  that  a  neighbor  from  area  Aj  transmits  to  i  and  i  can  receive 

the  packet. 


N. 


G  (1  -  G)  -  no  capture  (one  transmission) 


pj  *  1  -  (1  -  G)  -  capture  (one  or  more  attempts  made) 

P«  -  Probability  that  captured  packet  is  destined  to  i,  while  source  has  N. 
options  to  address. 


Therefore, 


a.  No  capture  (slotted  ALOHA): 
I 

qj  s  G(1  -  G) 


«i 


b.  Capture: 

0|-  V 


N, 


Assuming  that  the  number  of  terminals  in  an  area  A  is  a  Poisson  random  variable  with 
density  X,  we  have  that  S  (the  throughput  to  a  node)  is  given  by: 

-  (XA)^  e'^^ 

S  *  £  ~  ■"  ■ 

i*l  i! 

XA  a  N  -  Average  degree. 

Therefore, 


a.  No  capture: 


vj 


•:» 


2.3 


SaGe'^^  -Ge'^ 


t>.  Capture: 


S  a  (1  -  G)  e'**  E  -L 


E  J_  ri.{i-G)n 

i=l  i!  i  L  J 


The  maximum  throughput,  will  be  achieved  with  G  =  G^^>  by  setting: 

- 

de*o 

The  total  network  throughput  will  be: 

Where  n  is  the  total  number  of  nodes.  The  average  number  of  hops  in  an  end-to>end  path  h 
is  given  in  [12]  : 


(«■)* 


w  i  -N  rcos’^(t)-t>/r?i 
1  +  e  ^  /  e  ?  L  J  dt 


Therefore,  the  end-to-end  throughput  Y  is: 


To  find  G^^: 


a.  No  capture:  it  is  given  in  [12]  . 


b.  Capture: 


Sa(i-G)e‘^  £  -  fl-d-G)*]  ; 

iai  i!  i  ^  J 


2.4 


■I4"f 


—  1'^%  ’ 


nac 


I 


dS  . 

as  - « 


—  ri-(i-G)‘i 

i  i!  i  L  J 


(1  -G)e‘^  £  -  i  (1  -  G)’  =  0 

i  i!  I 


jNa-G)_j+  £  JL  [i-d-G)*] 

i  i!  i  L  J 


Therefore, 


» t 


•J 


«f  j 


2.5 


3.  OWSLOTTED  CASE  MODEL 

For  the  unslotted  case,  the  following  schemes  are  considered: 


Pure  ALOHA  (All  PRU's  transmit  at  random.  Collided  receptions  result  into  loss 
of  all  colliding  packets). 


-  Perfect  Capture  (In  case  of  collision,  the  receiver  keeps  tracking  the  packet  it 
received  first.  This  is  possible  under  a  broad  range  of  spread  spectrum 
technologies). 

•  Half-Amplitude  Capture  (In  ease  of  collision,  the  receiver  will  listen  to  the 
closest  transmitter). 

As  in  the  unslotted  ease,  we  assume  that  PRU's  are  randomly  located  in  the  network  area. 
The  rate  of  new  packets  and  total  rate  of  packets  (including  retransmissions)  between  PRU's 
separated  by  a  distance  d,  are  given  by  and  respectively. 

The  following  areas  are  defined  (see  Figure  2): 


% 


Aj^  -  Inside  the  contour  D-E-F-C-H-D. 

Aj  *  Inside  the  contour  O-K-C-F-G-E-0. 

Aj  -  biside  the  contour  D-J-F-G-E. 

A^  -  Inside  the  contour  E-G-F-A-E. 

Aj  -  Inside  the  contour  0-E-A-F-C-H-D. 

These  areas  can  be  expressed  as  functions  of  communication  radius  (r)  and  variable  angle  o 

Aj  *  sr^ 

2  r* 

Aj  •  nr^  -  ~  (2  -  sin  2  0^)  •  2 


3.1 


nac 


0,  if  d  <  r/2  or  9^  >  cos"^  ^ 

2 

2~  (2  Q3  -  sin  2  93)  -  Y-t2  0  2  "  sin  2  82)  = 


i -  +  5  sin  [2  CQs"^  /Li - \  ]  )  ; 

4  0080^  ^  L  \4coa0jyJ) 


Since, 


d  *  2r  cos  9j^; 


cos“^  ;  snd0j  «  2  03; 


ffd^  »  4  irr^cos^  0j^  ,if9j>eos“^  | 


^  * 


ird^-Aj  =  irr^ 


rtr^  -  A, 


irr^  ' 4  cos^  0jj  ;  if  0j^  >  cos’^  ^ 


Therefore,  the  joined  probability  of  the  number  of  nodes  in  the  areas  above  is: 


P(nj,  02)  = 


n-  n, 
(AA^)  ^  (AA^)  ^ 


"2- 


^-X(Aj  *  A2) 


P(n^,  nj) 


'S  "3 

(XAj)^  (AAj)  ^ 

"r  "3- 


^-A(Aj  ♦  Aj) 


3.3 


Wnj,  n^,  nj) 


3 


rij!  n^f  nj' 


n.  n. 

(XA  )  *  (XA.)  ...  .  .  . 

POj^.nj)  =  - 1 - 5 - 


V  "s’ 


:th 


Where  =  population  of  i  ‘  area. 

Using  the  average  degree  definition: 


N  a  Xsr" 


we  obtain: 


P(nj,  Hj)  = 


N<"l  *  "j> 


Hj!  aj! 


exp 


{>(-^  -  =^)j' 


n. 


P(n^y  n^)  ■“ 


N^"l  ■*■  "3^  /  Aj  ' 


nji  ir 


"3  \r2 


exp 


n^,  nj)  = 


N^"3  *  "4  "  "5) 


I  /  "3 


nj!  n^!  nj! 


n  V  ^2^ 


j4  cos^  9^ 


•fl-4cos2  0  ♦  i-  -i.'\  .exp(-Nfl  +  - 


P(n^,  Hj) 


N 


(n^  ♦  nj) 


"s' 


4  cos  0 


1-4  cos"  3 


-N 

ij  •'  ' 


In  order  to  compute  the  throughput  (successful  transmission  rate  S^)  between  PRU  1  to 
PRU  2  separated  by  a  distance  d,  the  following  observation  is  made:  For  PRU  2  to  receive 
correctly  a  packet  (of  duration  normalized  to  1),  the  packet  should  be  transmitted  from: 

-  Area  (Aj^  UAj)  during  |t  -  1,  t  +  l|  for  pure  ALOHA,  (t  is  the  time  of  packet 
transmission). 

-  Area  (Aj^  UAj)  during  |t  -  1,  t|  for  perfect  capture. 

•*  Area  (A^  UA^  ^^5^  during  |t  -  1,  t  >  for  half  amplitude  capture. 

The  probability  of  successful  transmission  will  be  given  as  follows: 


Pure  ALOHA: 


E 

> ; 

nj  >  0 


j  -2G  (n^  +  nj) 


n^  >  2 


~  e 
"1 


P(nj^  n2) 


Perfect  Capture; 


S. 


El  n/  \ 

X  •  1  2  Knj.nj) 

>  2  1 


"2 


Half  Amplitude  Capture; 


S. 


—  -  E 


.  -2G  (n.  >  n-) 

-  «  P(n3,  n^,  ng); 


i/  Aj  exists. 


L 

"4  >  2 

"5>0 


"4  ^"5 


-2G  n. 

*  P(n^,  Aj); 


if  A3  *  0. 


Using  the  expression  for  probabilities,  transformation  d  — » Q  (see  below),  and 
uneonditioning  with  respect  to  e 


Pure  Aloha: 


S* 


ttj  >  2 


(n,  +n,)  ff/2 
-2G(n,+n<,)  N  ^  ^  /* 

Ge  ^  J  2  sine-  |  1 


2e^  sin  20J 


njjO 


exp 


W2ei  sin  2  e-  \  X 


Perfect  Caoture: 


S  = 


S  C 


-Gfn^+nj) 


(n^+nj)  x/2 


n^^  >  2 


N**  r  /20,  3in20,\ 

.  7  2  sine,  1-—  ^  - ~] 


n^>0 


.  exp  j  N  f- 


2  ♦ 


20J  sin  2  0  ^ 


1  !  '<9li 


Half-Amplitude  Capture; 


S 


E 

n .  >  2 
Vi  i  ° 


Ge 


-2G(n3+n^)  N 


"3^5- 


cos’^  J  =  1.32 


s  1.043 


1  -  4  005*0, 


exp 


L  G« 


-2Gn^  j^^"4*"5^ 


x/2=1.57l 


"4  >2 

n5>0 


"4-  "s'  ^"4‘^"5^  cos"^  ;  =  1.32 

4 


y*  2sin0^  ^4co8^0j^^ 


(l-4cos^0j)  e”^  d0^; 


To  find  Ggp^,  we  set; 


This  equation  is  solved  numerically  using  Newton's  method: 


qW+I)  _  qO)  . 


r  (G^‘^) 


for  given  error  value  allowed. 

Using  R ,  [12] ,  we  obtain  the  throughput: 


;  where  s  n  •  S 


Tranformation  d  —*9. 


Since  d  is  uniformly  distributed  (see  Figure  3)  its  distribution  will  be: 


fe  (Sj)  * 


(2r  cos0j) 


2rVl  - 


-  2  sin  0 


2rvi-- 


4r^cos^  0, 


for  y  <  0^  <  Y  ; 


4.  NUMERICAL  RESULTS 


The  analyses  described  above  were  implemented  in  computer  programs  in  order  to 
compute  the  maximum  throughput  for  various  average  degree  values.  These  programs  for 
the  unslotted  cases  involve  considerable  computational  complexities  and  excessive  run 
times.  We  have  been,  however,  able  to  obtain  the  basic  results  which  are  portrayed  in 
Figure  4. 

Y 

In  Figure  4  we  show  the  normalized  throughput  —  as  function  of  the  average  degree 

Vn 

N.  For  slotted  capture  ease,  the  value  of  N,  which  maximizes  the  throughput,  is  9  or  10,  at 
which  point  the  optimal  network  throughput  is  0.19  y/n  .  The  maximizing  value  of  N  for  the 
four  other  eases  is  approximately  6,  which  is  the  same  result  as  in  [12] . 

Note  that  the  significance  of  the  optimal  degree  can  be  interpreted  as  what  should  be 
the  density  of  PRU's,  in  a  deployment  scenario  in  order  to  achieve  maximum  throughput. 


5.  FURTHER  RESEARCH 


Th«  approximate  nature  of  the  models  studied  was  Justified  from  the  need  to  obtain  a 
tractable  solution  to  the  optimal  density  of  radio  units  in  a  deployment  scenario.  The 
following  extensions  can  be  performed  using  similar  assumptions  on  the  random  topology  and 
disregarding  boundary  effects: 

I 

•  Model  Carrier  Sensing  Multiple  Access  (CSMA)  with  and  without  captiire.  Ho 
exact  models  for  CSMA  have  been  so  far  obtained,  due  to  the  non-Marcovian 
nature  of  the  resulting  packet  transmission  process  (see  [3]).  Approximate 
methods  can  be  devised  or  additional  assumptions  on  exponential  packet  lengths 
can  force  a  Markovian  birth-death  approach  as  in  [3] . 

-  Model  flood  routing,  whereby  every  PRU  relays  all  received  packets,  regardless 
of  their  ultimate  destination. 

Further  research  should  address  the  impact  of  specific  topology  and  routing  instead  of 
the  random  dispersed  PRU*s  over  an  infinite  area.  More  specifically,  in  [3] ,  Boorstyn  and 
Kershenbaum  proposed  a  methodology  for  modeling  CSMA  packet  radio  networks  with  or 
without  capture,  under  the  assumption  of  exponential  packet  length.  The  generalization  of 
the  method  to  Hxed  or  general  packet  length  distribution  is  still  an  open  issue. 

Other  areas  of  interest  are: 

1.  Flow  control  techniques  to  achieve  maximum  throughput. 


2.  Feasibility  of  using  packet  radio  networks  for  packetized  voice. 
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A  Technique  for  Adaptive  Routing  in  Networks 

R0HI;RT  R  BOORSTYN.  mimhik.  III).  VM)  ADAM  LIVNE.  kef. 

(tnvUfd  Paper) 


Abstract — A  iMo-Irtrl  adipllvr  rnulinK  srhrmi'  Fdf  piirkri-^wili'hrd 
rotnpuirr  loinmunirvliiin  nrlworks  is  pr<ip<isrd  and  intcsIiKaIrd.  Tht 
Tirsl  levrl  is  quasi-slatio  and  hasrd  on  Ihr  (Inhal  nrloork  slalus.  Thr 
srrnnd  K'vrl  is  dynamic  with  decisions  brine  made  at  rack  nodr  in  an 
alirmpi  In  obtain  Ihr  savines  in  averapr  delay  prrdirird  by  a 
mullisrrvcr  modri  of  Ihr  nodr.  .Simulations  conrirm  Ihr  prrdirird 
improvrmrnt. 

1.  INTRODUCTION 

OST  adaptive  routing  .<K;hemes  perform  about  as  well  as 
nonadaptive  schemes  when  evaluated  in  a  fixed  enviion- 
ment  (!].  However,  they  do  adapt  to  changes  in  network 
topology  and  input  statistics.  The  reason  they  do  not  show  a 
significant  improvement  is  that  they  are  actually  quasi-static 
in  that  they  sense  and  respond  to  the  above  changes  slowly. 
Their  goal  is  to  select  good  paths  [2] . 

We  focus  our  attention  on  the  node,  which  may  be  viewed 
as  a  multiple  server  queueing  system.  Most  adaptive  routing 
schemes  operate  the  node  as  a  collection  of  single  server 
queues.  If  the  node  is  operated  as  a  multiple  server  queue,  an 
advantage  at  the  node  of  a  factor  approximately  equal  to  the 
number  of  servers  (outgoing  branches)  can  be  obuined.  How¬ 
ever.  the  control  over  good  paths  may  be  lost.  We  show  that 
we  can  simultaneously  obtain  both  good  paths  and  improved 
node  performance.  These  improvements  in  time  delay  exist 
for  the  network  as  well.  Consequently,  network  bandwidih 
can  be  significantly  reduced.  Furthermore,  the  approach  lends 
Itself  to  analysis.  We  also  discuss  limits  and  extensions. 

Many  computer  communication  networks  (ARPANET. 
TELENET.  TY.MNET.  etc.)  use  dynamic  routing  schemes  to 
compensate  for  input  traffic  variations,  to  respond  to  changes 
in  topology,  and  to  take  advantage  of  temporary  changes  in 
loading  in  different  paths.  During  the  design  of  a  network, 
they  are  replaced  by  analytically  tractaole  nondynamic  (static) 
schemes.  We  present  here  a  routing  scheme  for  which  we  have 
been  able  to  denve  .ipproximate  analytical  models.  Further¬ 
more.  we  c.in  establish  the  efficiency  of  this  scheme,  especially 
in  heavily  loaded  situations. 

A  typical  static  routing  scheme  would  operate  as  follows. 
Copsidci  as  separate  commodities  the  message  originating  at  a 
particular  node  and  destined  for  a  second  node  in  the  network 
The  static  routing  scheme  would  specify  the  optimum  propor- 
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tion  of  traffic  to  be  routed  over  each  path.  Efficient  algorithms 
exist  for  design  of  tliis  type  of  routing  (3| . 

We  can  identify  one  particular  problem  with  this  approach. 
Although  good  paths  are  indeed  found,  any  node  essentially 
operates  as  a  collection  of  single  server  queues-one  queue  for 
each  outgoing  branch.  Considering  the  node  as  a  queue  with 
several  potential  servers,  this  is  not  an  efficient  manner  of  op¬ 
eration.  Indeed,  if  a  node  had  k  outgoing  branches  and  were 
operated  as  a  queue  with  k  servers,  then  the  time  delay  would 
be  reduced  by  a  factor  of  approximately  k.  Conversely,  the 
throughput  can  be  increased. 

If  the  node  were  operated  as  suggested  above,  messages 
would  wander  aimlessly  through  the  network  and  the  total 
performance  would  be  abysmal.  Our  approach  is  to  retain  the 
good  paths  for  commodities  and  yet  still  get  the  benefit  of  the 
faster  performance  at  the  node. 

Briefiy  our  scheme  is  as  follows  [4] .  Consider  a  node  as  a 
single  queue  with  several  servers  (output  channels).  For  a 
particular  commodity,  i.e.,  a  message  with  a  certain  destina¬ 
tion,  the  use  of  some  of  these  servers  would  cause  the  mes 
sages  to  be  sent  along  “bad"  paths-either  loo  long  or  too 
congested.  Thus,  for  each  commodity  and  at  each  node  we 
specify  a  subset  of  the  output  channels  as  allowable  and 
permit  the  message  to  use  any  allowable  channel  according  to 
some  discipline.  Each  commodity  appearing  at  the  node  has 
its  own  allowable  set  of  channels.  There  restrictions  force 
messages  to  use  “good”  pathis.  It  has  already  been  noted  [5] 
that  an  advantage  can  be  obtained  if  a  selection  between 
relatively  good  paths  is  not  forced,  and  the  adaptive  routing 
scheme  continues  to  allow’  each  of  these  to  be  used. 

The  assignment  of  allowable  branches  at  each  node  for  eacti 
commodity  is  one  level  of  our  adaptive  routing  scheme.  These 
assign-ments  are  based  on  essentially  global  information  of 
topology,  flows,  and  long  term  averages,  and  may  be  adaptive 
in  a  quasi-static  way.  The  second,  and  truly  dynamic,  level  of 
our  routing  strategy  is  local  and  involves  the  queue  discipline 
at  each.  node.  As  an  example,  consider  a  node  with  two  out¬ 
going  channels  (servers).  All  message  commodities  fall  into 
three  classes.  Two  of  these  must  use  only  one  of  the  servers 
and  h.ive  no  choice.  Messages  join  the  appropriate  queue  ant! 
arc  served  in  turn  Tlie  third  class  of  messages  may  use  either 
of  the  two  servers  and  join  a  third  queue. 

Another  strategs  at  a  node  is  to  give  priority  to  the  mes¬ 
sages  Uiat  have  no  choice,  i.e..  are  dedicated  lo  one  of  the 
servers.  Yet  another  strategy  is  lo  allow  messag-.'s  in  the  tlm.' 
category  (noiidedicated)  lo  join  the  shoiier  ol  the  iwo  dedi¬ 
cated  queues  We  have  evaluated  the  peiloimance  of  bo'ii 
stiateeica.  and  oilier  siinilai  sti.iie;.'ies  li'i  ihis  si.nple  two 
scrvei  iiioiel.  lo;  'iioie  ».oiii()le\  iiooe  models,  and  for  simp!.' 
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networks.  The  measure  of  performance  was  the  average  time 
delay  for  all  messages  traversing  the  node. 

II.  NETWORK  MODEL 

We  consider  a  network,  actually  the  backbone  of  a  distrib¬ 
uted  (store-and-forward)  packet-switched  data  communica¬ 
tions  network,  as  a  collection  of  nodes  connected  by  full 
duplex  links—comprising  the  topology.  The  input  traffic  is 
described  by  a  matrix  of  rates  (packets/second)  of  traffic 
originating  at  one  node  and  destined  for  another.  Index  the 
nodes  by  1,  N  and  denote  the  links  between  nodes  i  and/ 
by  (i. /’)•  Alternately,  index  the  links  by  1.  ••,  /,.  fWe  assume 
N  nodes  and  L  links.)  Denote  the  input  traffic  (from  source  i 
to  destination  /)  by  7,,  (packets/second).  Assume  (for  simplic¬ 
ity)  that  the  average  length  of  all  packets  is  l/n  bits.  Assume 
further  that  all  input  streams,  for  each  of  tlte  —  1)  node 
pairs,  are  independent  and  Poisson.  Let  the  link  capacities  be 
given  by  C/. 

A  nonadaptive  routing  is  given  by  a  set  of  paths  for  each 
commodity  (node-pair  traffic)  and  the  proportion  of  input 
traffic  using  each  path.  From  this,  the  flow  in  each  link  in 
each  direction,  X(ij-)  =  X(  (packets/second),  can  be  found. 
Note  that  7  =  £/,/7r,/  is  the  network  throughput,  or  total 
offered  load,  and  X  »  Zf,/X//,/)  is  the  total  internal  traffic. 
Furthermore,  note  that  /  =  X/7  >  1  is  the  average  path  length 
in  the  network. 

We  assume  exponential  packet  lengths.  We  further  make 
the  “independence”  assumption  [6]  that  at  each  node  in  a 
path,  a  new  random  assignment  of  the  packet  length,  with  the 
same  distribution,  is  made.  This  idealization  leads  to  the  prop¬ 
erty  that  each  node  can  be  modeled  independently  as  simple 
Af/Af/1  or  MIMIk  queues.  It  has  been  shown  that  this  approxi¬ 
mation  yields  valid  results  for  many  network  situations  (6) . 
Some  care  must  be  taken  in  situations,  such  as  adaptive 
routing,  where  dependence  between  nodes  is  a  key  factor.  We 
will  ignore  this  warning  until  the  end,  when  we  comment  upon 
its  implications,  and  assume  the  above  type  of  independence  at 
each  node. 

Many  streams  of  traffic  converge  at  a  node-from  incoming 
network  links  and  from  outside  the  network.  Some  of  this 
traffic  is  destined  for  this  node,  leaves  the  network,  and  will 
not  concern  us  at  this  node.  The  remaining  traffic  must  be 
routed  to  an  appropriate  outgoing  link.  A  nonadaptive  strategy 
decides  what  proportion  of  such  traffic  uses  each  link  on  a 
commodity-by-commodity  basis.  Thus,  each  link  can  be 
modeled  as  an  Af/Af/1  queue  and  has  link  average  time  delay 
Ti  =  l/(/iC|  —  X/).  The  average  time  delay  in  the  network  is 
T  =  (Z/X/T/l/y.  If  we  define  Tiini,  =  (2,X/r/)/X  as  the  time 
delay  of  a  typical  link,  then  T  = 

III.  NODE  MODEL 

A  node  in  a  network  has  several  outgoing  links,  say  k\  it 
is  said  to  be  of  degree  k.  In  nonadaptive  routing,  the  node 
operates  as  a  collection  of  single  server  queues  which  we 
can  model  as  M/MI\  queues.  Thus,  the  time  delay  for  link 
I  is  Ti  =  llifiCi  —  X/).  The  time  delay  for  node  n  is  T(„)  - 
2/^»i,/7'(«,/)/X(rt)  where  X(„)  =  2,X„.r  is  the  rate  of  traffic 


that  node  n  sends  onto  the  network.  Tlie  network  time  delay 
is  T  =  I„X(„)7’(„y/7  v.'hicl:  gives  the  same  result  as  above,  but 
expressed  in  terms  ol'  nodal  delay. 

Ignoring  routing  tor  a  nior.ctit.  the  node  can  be  viewed  as  a 
multiple  server  queue  (with  k  servers).  If  we  assume  all  C,  =  C 
at  this  node,  and  if  alt  packets  join  one  large  queue  and  are 
served  by  any  available  server,  then  we  have  an  M/M/k  queue 
with  average  waiting  time  at  node  n  given  by 


IV  --Lfs-L 
''"’"vc  1 1-; 


(I) 


where  p  =  \(„yliikC  and  Pg  is  the  probability  that  all  servers 
are  busy.  Although  Pg  can  be  enumerated  given  p  and  k,  the 
only  property  we  need  here  is  that  ffl  -►  1  as  p  -►  1 . 

Contrast  the  above  result  to  that  for  a  simple  server  queue 
where,  with  p  =  X//pC,  the  waiting  time  at  link  /  is 


W,= 


pC 


p 

1  -p‘ 


(2) 


We  see  that  the  multiple  server  operation  has  a  waiting  time 
advantage  of  at  least  k.  The  time  delay  is  the  waiting  time  plus 
the  service  time  (1/pC).  As  p  -►  1 ,  the  waiting  time  dominates, 

-V  1 ,  and  the  multiple  server  operation  has  an  advantage  in 
time  delay  of  a  factor  of  k.  Although  we  concentrate  in  this 
paper  on  packet  time  delay  as  a  performance  measure,  many 
other  network  situations  focus  on  waiting  time.  In  that  case, 
greater  reductions  in  delay  may  be  achieved  since  Pg  is  of  the 
order  of  p* . 

In  making  the  above  comparison,  we  assume  that  the  'ink 
and  node  utilizations  are  not  increased  when  adaptive  r'lu'i  g 
is  used  to  obtain  multiple  server  performance  at  the  no^^' ..  As¬ 
sume  first  that  only  minimum  hop  paths  are  used  for  the  non- 
adaptive  routing,  resulting  in  single  server  queues  at  the  links. 
Allow  the  adaptive  routing  to  choose  between  all  minimum 
hop  paths  for  each  commodity.  This  will  produce  the  same 
utilization  in  both  cases.  If  other  paths  are  used  in  the  non¬ 
adaptive  routing,  similar,  but  less  precise,  arguments  can  be 
made.  It  will  be  seen  in  the  next  sections  that  the  adaptive 
routing  performs  better  when  more  of  the  traffic  has  choice. 
Since  there  is  a  significant  improvement  in  time  delay,  when 
the  utilization  is  moderate  it  is  possible  to  trade  off  a  higlier 
utilization  by  using  longer  paths  with  the  resulting  greater 
choice.  In  the  examples  discussed  below,  we  have  not  found 
this  to  be  necessary.  In  some  cases  where  minimum  hop  paths 
are  not  used  in  the  optimum  nonadaptive  routing,  the  adaptive 
routing  could  still  provide  a  choice  of  similar  length  paths. 
Then  the  utilization  will  not  be  increased. 

These  observations  are  summarized  in  Figs.  1  and  2.  In  Fig. 
1  we  compare  waiting  times  for  queues  with  two  servers  and  a 
single  stream  of  input  traffic.  Multiple  server  operation  is  best. 
In  the  other  three  cases,  a  queue  is  provided  for  each  server 
and  the  input  stream  is  divided  into  '.wo  in  different  ways  In 
order  of  decreasing  performance,  an  arnval  joins  the  shorter 
queue,  arrivals  alternate  between  the  two  queues,  and  arnvals 
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nadondy  sdeet  a  queua.  Ilia  lattar  li  aquhakiit  to  two  tin^e 
aemr  qucuai.  In  Fig.  2  wa  contmt  moltipla  aarmandtinile 
term  perfonmiioa  in  a  node. 

Our  adaptive  routing  procedure  will  attempt  to  obtain  these 
performanoe  advantages. 

IV.  ADAPTIVE  ROUTING  PROCEDURE 

Pint  consMa  routing  all  packets  at  a  node  by  allowing 
them  to  use  any  outgoing  link.  The  node  can  new  be  modeled 
as  a  multiple  server  queue  and  achieves  the  already  described 
performance  advantage  over  single  server  operation.  Path 
lengths,  however,  would  becotiw  catremely  long  because  of 
this  random  routing,  the  total  traffic  in  the  network  would  in- 
create  catastrophically,  and  total  network  performance  would 
be  destroyed.  We  wiU  show  how  to  achim  the  time  delay 
advantage  without  this  deterioration  in  total  network  perform¬ 
ance. 

We  propose  a  two-level  adaptive  routing  procedure.  The 
first  level  it  global,  and  while  still  adaptive,  is  expected  to  be 
slowly  varying,  responding  to  average  statistics  of  congestion 
and  traffic  and  alarms  due  to  link  failure,  onset  of  congestion, 
new  traffic,  etc.  We  auume  that  tome  mechanism  exhtt  for 
making  adjustments,  and  that  these  will  be  made  relatively 
infrequently  compared  to  the  rate  of  second  level  adaptivity. 
We  will  assume  in  this  paper  that  the  first  level  of  adaptation 
does  not  change  over  the  period  of  interest. 

The  first  level  consists  of  assigning  a  set  of  "good”  paths  to 
each  commodity  or  node  pair  that  wish  to  cominiinicaie.  In¬ 
stead  of  finding  the  “best”  patli,  we  took  for  as  large  a  sci  as 
possible  of  paths  that  are  good  in  some  sense  -a  small  number 


of  hops,  small  congestion,  etc.  (A  similar  approach  was  pre¬ 
sented  in  (S|.)  We  do  not  force  ourselves  at  this  level  to 
choose  among  alternative  paths  that  are  of  the  ume  or  similar 
quality.  That  is  done  in  the  second  level.  The  second  level  per¬ 
forms  better  if  there  are  more  alternative  paths.  There  is  an 
inherent  tradeoff  here,  since  choosing  longer  paths  places 
more  traffic  on  the  network.  We  assume  that  a  selection  of 
paths  is  possible  and,  although  we  do  not  investipte  It  here, 
that  it  can  be  made  slowly  adaptive. 

As  a  result  of  the  first  level  procedure,  there  will  be  nodes 
where  the  paths  for  a  particular  commodity  Intersect,  including 
the  originating  node.  Packets  at  such  a  node  have  a  choice  of 
which  outgoing  link  they  can  use  (two  or  more  links  will  be 
in  the  various  allowable  paths).  There  are  certain  links  which 
may  not  be  used  by  a  particular  commodity— they  lead  to  bad 
paths.  We  do  not  force  the  choice  on  the  packet  upon  arrival 
at  the  node,  but  operate  the  node  u  a  (constrained)  multiple 
server  facility.  If  all  packets  could  choose  any  of  the  links, 
then  it  would  indeed  be  a  multiserver  facility  and  we  would 
get  the  k-fold  improvement  in  delay.  We  will  show  that  most 
of  this  improvement  is  still  obtained,  even  when  some  serven 
cannot  service  some  commodities. 

This  two4evel  adaptive  procedure  will  then  give  us  the  de¬ 
sired  improvement  in  time  dday  while  maintaining  good  paths. 

V.  ADAPTIVE  NODE  MODEL 

Pint,  as  shown  in  Rg.  3,  consider  a  node  with  two  outgoing 
linka  (e  and  b).  Some  commodities  have  assigned  paths  that 
only  uie  one.of  these  links.  They  have  no  choke  and  wiO  be 
routed  directly  to  a  dedicated  queue  which  can  only  be  served 
by  one  of  the  servers.  Denote  the  total  rate  of  tralBc  Joining 
each  of  the  queues  u  X,  and  X».  Other  commodities  (of  total 
rate  X,)  have  paths  which  use  either  link  e  or  b..  We  assign 
them  to  a  third  queue  (c).  Server  e,  for  example,  may  choose 
to  serve  a  packet  from  a  or  c,  but  not  b.  We  will  discuss  dif¬ 
ferent  strategies  for  the  server. 

The  only  way  this  constrained  multiple  server  queueing 
system  differs  from  Af/Af/2  behavior  Js  v^en  queues  a  and  c 
are  empty,  for  example,  whUe  queue  b  is  not.  One  of  the 
serven  (a)  is  idle  while  the  node  still  has  tome  work.  To  reduce 
the  possibility  of  this  happening,  we  could  keep  queue  e  as 
large  as  possible.  Thus,  an  excellent  strategy,  which  appean  to 
be  optimum,  it  to  give  priority  to  the  dedicated  queues,  serving 
queue  c  only  when  one  of  the  dedicated  queues  it  empty. 
Another  strategy  it  to  allow  packets  from  the  stream  X^  to 
join  the  shorter  of  the  two  queues  (thus  eliminating  the  need 
for  the  third  queue).  We  have  [4]  evaluated  these  and  other 
strategies  for  this  node  model  and  more  complex  node  models, 
and  have  found  the  performance  of  the  node  at  measured  by 
average  time  delay  to  be  fairly  insensitive  to  the  strategy 
adopted  by  the  servers.  Although  different  classes  of  traffic 
may  suffer  radically  different  time  delays,  the  average  delay  of 
all  traffic  in  a  node  remains  roughly  the  tame  for  many  good 
strategies.  We  will  consider  below  only  two  strategies— the  one 
which  gives  priority  lu  the  dedicated  queues  and  the  one  in 
wliich  arrivuls  join  the  shortest  queue. 

In  Fig,  4  we  present  the  average  queue  length  f(AO  versus 
uiili/.ation  p  tor  a  two-server  nude  under  various  traffic  pal- 
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server  (link)  j 


server  (Hnk)  b 

Fig.  3.  A  node  with  two  servers. 
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Fig.  4.  Estimated  average  number  of  packets  in  a  node  with  two  links. 

Simulation  results. 

terns.  We  assume  that  the  two  link  capacities  are  equal  to  C, 
X*  =  Xft,  p,  =  \J^C,  Pc  =  Xc/pC,  p  =  2Pc  +  Pc-  The  lower 
bound  is  for  an  M/MI2  queue  or  all  nondedicated  traffic  (X,  = 
0).  The  upper  bound  is  for  all  dedicated  traffic  (Xc  =  0)  and 
represents  two  M/Ml  I  queues.  In  general,  all  of  our  results  lie 
between  those  two  bounds  of  performance.  The  circled  points 
are  simulation  results  for  different  traffic  mixes  and  the  strategy 
that  gives  priority  to  dedicated  queues.  From  this  figure  and 
from  similar  results  for  three-server  nodes  and  different  traf¬ 
fic  patterns  (as  discussed  below  and  in  Fig.  5),  the  following 
rule  of  thumb  has  emerged.  As  long  as  15-20  percent  of  the 
traffic  is  not  dedicated  and  the  choice  of  traffic  is  not  appor¬ 
tioned  to  disjoint  sets  of  servers,  then  the  node  behavior  will 
achieve  more  than  half  of  tlie  improvement  of  an  M/M/k 
queue.  It  has  been  shown  [7] ,  [8]  that  in  the  limit  of  heavy 
traffic  (high  utilization )  a  "join  the  shortest  queue”  policy  tends 
towards  the  performance  of  an  M/M/k  queue.  This  is  true  even 
if  some  of  tlie  arrivals  cannot  join  some  of  tlie  queues,  but 
there  must  be  some  linkage  between  all  the  queues.  This  is  just 
our  node  model.  Since  we  believe  that  our  priority  strategy  is 
better  than  a  join  the  shortest  queue  policy,  we  have  that  in 
the  limit  of  heavy  lufCic,  M/M/k  behavior  is  achieved 

The  above  remarks  hold  for  nodes  with  any  number  of 
links.  Assume  that  a  node  has  k  links.  Then  there  are  2*  -  1 
classes  of  input  traffic  -X  classes  of  dedicated  traffic,  one  to 
each  server;  k(k  —  l)/2  classes  of  traffic  assigned  to  specific 


pairs  of  servers;  '  ;  and  finally  one  class  of  traffic  that  can  use 
all  servers  (links).  Many  of  these  classes  may  be  empty.  But 
if  they  span  all  servers,  and  if  the  total  traffic  that  is  not 
dedicated  is  roughly  15-20  percent  of  the  total  traffic,  then 
nodal  delay  is  reduced  by  a  factor  of  order  k  over  a  single 
queue  model.  If  the  traffic  having  choice  is  too  light  or  if 
the  servers  are  in  isolated  groups,  then  this  advantage  is  re¬ 
duced. 

In  Fig.  5  the  normalized  time  delay  (in  units  of  service  time 
or  message  length)  is  given  as  a  function  of  the  amount  of  traf¬ 
fic  having  choice  at  a  three-server  node  and  for  various  utiliza¬ 
tions.  These  results  were  obtained  by  nodal  simulations.  The 
left  limit  of  the  graph  represents  M/M/l  or  nonadaptive  be¬ 
havior.  The  right  limit  represents  M/Af/3  or  the  limiting  adapt¬ 
ive  behavior.  There  are  two  types  of  traffic  that  have  choice 
at  a  three-server  node.  One  type  may  be  served  by  any  of  the 
three  servers.  The  other  type  may  be  served  by  only  two 
servers-there  are  three  different  members  of  this  type.  A  va¬ 
riety  of  different  combinations  were  simulated.  Tlie  resulting 
time  delay  showed  only  slight  variations  for  different  traffic 
patterns  and  depended  only  upon  the  total  traffic  havingchoice. 
It  is  seen  from  Fig.  5  that  if  15-20  percent  of  the  traffic  has 
choice,  then  the  behavior- is  significantly  closer  toAf/Af/3  than 
to  M/M/l. 

For  high  utilizations,  the  time  delay  is  dominated  by  the 
waiting  time  and  the  maximum  improvement  expected  is  by  a 
factor  equal  to  the  degree  of  tlie  node.  For  moderate  utiliza 
tions,  M/M/k  operation  would  result  in  even  greater  relative 
savings. 

We  have  been  able  to  develop  upper  bounds  on  node  per¬ 
formance  that  exliibit  some  of  the  above  behavior.  For  a  more 
accurate  understanding,  we  have  resorted  lo  the  simulation  re¬ 
sults  already  described. 

Consider  the  two-server  node  of  Fig.  3  and  assume  tJiat  die 
dedicated  queues  have  pnority.  Furihcrmore,  let  tlie  capacities 
C  =  1  for  convenience.  Then  we  can  derive  die  following  e\ 
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pressions; 

=  (3) 

=  (4) 

HE(W,)<1  Pg+i  KE{W,)  +  min  {X,(£W) +  £'(iV^)) }. 

i^a.b 

(5) 

In  these  equations  Wj  is  the  waiting  time  of  queue  i,  is 
iu  expectation,  p,  =  (X,  +  aX^Vp,  pj  =  (X^  +  (1  —  a)X<.J/p, 
and  a  is  the  proportion  of  X^  traffic  served  by  server  1 .  £5  is 
again  the  probability  that  the  system  is  busy.  Equation  (3)  is 
derived  as  follows.  An  arrival  to  queue  a  must  wait  for  server  1 
to  flnish  service,  if  it  is  occupied,  and  then  wait  for  all  mes¬ 
sages  that  it  found  in  queue  a  to  be  served.  Recall  that  queue  a 
has  priority.  The  probability  of  server  1  being  busy  is  Pt ,  there 
are  Xa£(h'a)  messages  already  on  queue  a  on  the  average,  and 
each  message  has  an  average  service  time  of  1/p. 

Equation  (S)  is  more  difficult  to  derive.  An  arrival  to  queue 
c  waits  for  the  next  completion  of  service  if  the  system  is 
busy.  The  probability  of  this  is  £3  and  the  average  time  to  the 
next  completion  is  l/2p.  It  must  also  wait  for  all  the  messages 
that  it  found  on  queue  c  when  it  arrived.  Again,  the  average 
number  of  these  is  X^(lt'f)  and  each  goes  into  service,  when 
allowed,  with  a  rate  2p.  But  queue  c  does  not  have  priority. 
Thus,  it  must  also  wait  for  the  first  queue  to  put  into  service 
all  messages  found  in  that  queue  upon  arrival,  Xj£(Ii'j)  on  the 
average,  and  for  all  other  messages  that  arrived  to  that  queue 
while  the  message  on  queue  c  was  waiting  — Xf£(h'f ),  on  the 
average.  The  last  term  in  (S)  should  be  the  expectation  of  the 
minimum  of  two  random  variables— the  total  number  of  mes¬ 
sages  in  queues  a  or  6  while  our  message  waits  in  queue  c.  We 
upper  bound  this  with  the  minimum  of  the  two  expectations. 

If  we  let  X,  =  Xft,  then  a  =  1/2.  Furthermore,  we  upper 
bound  Pg  by  =  Pi  •  Equations  (3>-(5)  can  now  be  solved  to 
obtain  for  the  average  waiting  time  for  all  messages; 

p£(H0<7-^(l (6) 
1  -p 

where  ^  =  X^/fX,  +  X#  +  X^),  the  fraction  of  traffic  having 
choice.  Note  that  for  A//A//1  behavior,  p£(W)  =  p/(l  -  p), 
and  forA//A//2  behavior,p£(l40  =  (p/(l  --p)XP/(i  +p)).As0-» 
0,  our  upper  bound  converges  to  A//Af/1  behavior.  As  0  -*  I, 
our  upper  bound  yields  a  reduction  of  1/2,  where  M/Mfl  be¬ 
havior  has  a  better  improvement  of  p/(l  -(■  p).  Our  upper  bound 
on  Pg  is  causing  the  discrepancy  here.  In  any  event,  we  note 
the  linear  behavior  with  0  in  (6). 

A  similar  expression,  with  similar  results,  can  be  derived  for 
a  three-server  node.  Let  the  arrival  rate  to  the  dedicated 
queues  be  equal,  and  assume  only  one  other  class  of  traffic 
which  can  choose  between  all  three  servers.  Tlien  we  obtain 

p£(H')  =  — (1 <’) 
1  -P 

where  0  has  the  same  meaning  as  above.  Here,  as  0  -  I  ,p/'.(  W') 


is  a  third  of  that  obtained  with  M/M  1 1  queues,  whereas  an 
M/Mji  queue  would  have  an  even  smaller  waiting  tune 

VI.  NETWORK  PERFORMANCE 

In  general,  we  can  say  that  if  we  adopt  this  adaptive  routing 
procedure,  and  if  at  each  node  there  is  a  sufficient  amount  of 
nondedicated  traffic,  then  at  high  utilization  the  time  delay  at 
each  node  is  reduced  by  almost  the  degree  of  that  node  from 
the  time  delay  using  nonadaptive  routing.  ITie  effect  on  the 
network  will  be  to  reduce  the  time  delay  by  a  factor  on  the 
order  of  the  average  degree  of  the  m  Jes.  For  moderate  utiliza¬ 
tions,  a  large  part  of  that  advantage  will  still  be  achieved.  As¬ 
sume,  for  simplicity,  that  using  nonadaptive  routing,  all  links 
have  the  same  average  time  delay  T^ .  Tlien  T  =  ITx-  If  our 
adaptive  strategy  is  used,  then  £(„)  =  l/k(„)7'i  where  k(„)  is 
the  degree  of  node  n.  For  this  adaptive  routing,  the  network 
delay  becomes  at  best  T^  lTi  *  Z„(X(„)/A:(„))/X.  If  all/C(„)  = 
k,  then  the  reduction  is  of  the  order  of  k. 

The  accuracy  of  the  approximate  performance  at  a  node 
improves  as  more  of  the  traffic  has  choice  and  as  the  utiliza¬ 
tion  increases.  These  results  serve  as  a  quick  estimate  of  the 
benefits  obtainable  from  adaptive  routing  and  are  useful  as  a 
design  tool. 

Vlll.  EXAMPLES 

We  have  evaluated  many  examples  of  networks  [4] .  Most 
of  these  had  some  symmetry  and  a  modest  number  of  nodes 
(at  most  20).  Our  evaluation  procedure  was  as  follows.  We  first 
obtained  the  optimum  nonadaptive  routing  and  evaluated  its 
performance.  In  most  of  the  examples,  this  resulted  in  using 
shortest  paths.  We  then  allowed  the  adaptive  routing  to  use 
all  shortest  paths  for  each  commodity.  We  then  found  the 
amount  of  traffic  that  had  choice  at  each  node.  For  a  wide 
variety  of  situations,  there  was  sufficient  nondedicated  traf¬ 
fic  at  each  of  the  nodes  to  justify  our  approximation. 

There  are  two  steps  in  a  careful  evaluation— the  computa¬ 
tion  of  the  amount  of  traffic  of  each  class  (with  respect  to  the 
nature  of  choice)  in  each  node  and  the  evaluation  of  node  per¬ 
formance.  We  have  already  discussed  good  estimates  for  the 
latter.  The  former  step  can  be  quite  complex,  since  it  depends 
upon  how  traffic  is  split  at  each  node  at  which  it  has  choice. 
We  have  developed  several  approximate  techniques  to  perform 
this  step  [4] .  For  example,  consider  Fig.  3  again.  Let  ctbe  the 
fraction  of  X^  traffic  that  is  served  by  a.  The  total  utilization 
of  server  a  is  (X*  +  oX, )/(j  =  p, .  Similarly,  define  =  [Xj,  -t- 
(1  -  Queue  a,  including  the  server,  is  empty  with 

probability  1  —  P| .  A  message  on  queue  c  wilJ  be  seived 
by  a  only  if  the  entire  queue  is  empty.  Tlius,  approximate  a  by 
o  =  (1  -  Pi)/(2  —  Pi  -  P2).  Tliis  can  be  solved  for  a  to  obtain 
a-(i-  Pa)/(2  -  Pa-  Pi,)  where  =  K/P-Pb  =  >^b/P- This 
approximation  has  been  verified  by  simulation  [4] . 

We  summarize  one  example  of  a  six-node  network  shown  in 
Fig  6.  Had',  of  the  nodes  has  degree  3.  We  allow  only  shortest 
paths  (level  one  of  our  auantivc  pioccduro).  Ihus,  all  traffic 
from  1  to  2  or  4  01  0  ;s  n.iiu'd  d!r:i.tl\  .  but  traffic  trom  1  to  3 
or  3  has  a  choi  ;e  o*'  thice  two  hop  paih.s.  We  assume  that  all 
links  have  the  same  c.ip.ici's,  pC  "  iU,  and  all  7,,  =  r  It  is 
fairly  ea.sy  to  see  that  >  =  .'di  /  7/5,  and  X  -  A2r  A  non 
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adaptive  optimum  routing  can  be  found  to  that  the  flows  in 
each  link  are  identical,  X|  »  7/3r.  Thus,  for  nonadaptive 
routing,  T “  1 .4/(10 -  7/3r).  Air-* 30/7  « 4.29,  the  adaptive 
routing  will  give  an  advantage  of  a  factor  of  3. 

We  now  compare  the  optimum  nonadaptive  routing  with 
our  adaptive  routing.  For  nonadaptive  routing,  we  used  the 
above  results  for  single  server  queues.  For  the  adaptive  routing, 
we  used  the  results  of  nodal  simulations  for  three-server  nodes 
and  T  *  lT^n)  a*  discussed  in  Section  VI.  For  r  ■  1 .84.  />  > 
0.43,  the  nonadaptive  routing  had  a  time  delay  of  0.245, 
while  the  adaptive  routing  had  a  time  delay  of  0.197.  For  r  ■ 
3X)6,  p  0.71 ,  the  results  were  0.490  for  nonadaptive  routing 
and  0.29S  for  adaptive  routing.  Thus,  we  have  reduced  the 
time  delay  by  a  factor  of  1 .66  in  the  latter  case.  In  this  ex¬ 
ample,  the  amount  of  traffic  having  choice  at  each  node  was 
2/7  of  the  total  or  29  percent.  If  full  multiple  server  perform¬ 
ance  was  attainable,  the  time  delay  would  have  been  0.217. 
Thus,  70  percent  of  this  improvement  was  attaitMd.  The 
fanprovement  increases  if  more  traffic  hat  choke.  As  the  utili¬ 
sation  increases,  the  full  benefit  of  multiple  server  perform¬ 
ance  is  achieved.  The  comparison  can  also  be  made  with  respect 
to  waiting  tiitMS  tfaice  the  aervioe  timet  are  constant  and  can¬ 
not  be  reduced.  Here  the  service  time  for  the  network  is  0.i4. 
Thus,  for  43  percent  utilization,  the  waiting  time  has  been 
reduced  from  O.IOS  to  0.0S7  by  use  of  adaptive  routing— a 
factor  of  1 J84.  At  71  percent  uti^tion,  the  reduction  is  from 
0  JSO  to  O.lSS-a  factor  of  2.26. 

A  number  of  other  examples  with  as  many  u  20  nodes  have 
been  investigated  [4] .  Mott  had  some  amount  of  symmetry, 
but  not  nearly  as  much  u  the  example  dted  here.  One  12- 
node  example  was  generated  arbitrarily.  In  all  cases,  the  above 
procedure  was  followed.  The  optimum  nonadaptive  routing 
was  found.  In  almost  all  of  the  examples,  shortest  paths  were 
used.  The  adaptive  routing  was  created  by  giving  a  choice 
among  all  shortest  paths  for  each  commodity.  There  was  suf¬ 
ficient  choice  at  most  nodes. 

The  effects  of  asymmetry— of  the  topology,  the  traffic 
matrix,  and  link  capacities-have  not  yet  been  adequately 
studied.  For  very  small  networks  or  pathological  cases,  longer 
paths  may  have  to  be  used  by  the  adaptive  strategy  to  achieve 
sufficient  choice  at  some  nodes.  This  will  increase  the  utiliza¬ 
tion  and  reduce,  and  perhaps  eliminate,  the  performance 
advantage.  However,  the  performance  advantage  seems  signifi¬ 
cant  enough  to  leave  some  margin  for  this  tradeoff. 

Asymmetries  in  link  capacities  and  traffic  patterns  create 
another  problem.  Tlie  second  level  of  adaptive  routing  is  based 
on  local  information  only.  This  may  lead  to  saturation  at  other 
nodes.  The  first  level  of  the  adaptive  routing,  as  described 
above,  may  not  have  enough  flexibility  to  circumvent  this.  We 


have  begun  to  study  a  variation  that  si.ows  some  promise.  Con¬ 
sider  traffic  between  a  paii  of  ntidcs  that  has  two  or  more 
good  paths.  Pieviously  we  as.sum^d  i.'ui  all  that  traffic  was  to 
be  given  choice.  Here  we  dedicate  some  of  that  traffic  to  speci¬ 
fic  paths  in  order  to  balance  the  load.  For  example,  we  may 
wish  to  produce  a  flow  distribution  si.nitlar  to  that  used  in  the 
nonadaptive  routing  scheme.  The  remainder  of  the  traffic  is 
allowed  to  have  choice  as  before.  Since  the  amount  of  traffic 
having  choice  at  a  node  is  not  critical,  it  is  expected  that  it  will 
still  be  possible  to  achieve  a  performance  advantage.  Several 
simple  examples  that  have  been  tried  have  been  encouraging. 

A  brief  report  on  two  further  examples  illustrates  these 
ideas.  Details  can  be  found  in  [4] .  Fig.  7  shows  an  asymmetri¬ 
cal  eight-node  network.  We  assume  equal  channel  capacities 
and  equal  requirements  between  pairs  of  nodes.  Four  of  the 
nodes  have  two  serven:  the  other  four  have  three  servers. 
Hence,  we  expect  the  maximum  improvement  to  be  by  a  factor 
between  2  and  3.  The  optimum  nonadaptive  routing  uses 
shortest  paths.  The  adaptive  routing  allows  choice  among  these 
shortest  paths,  thus  retaining  the  ume  utilization.  The  most 
utilized  links  were  the  two  connecting  cluturs  of  four  nodes. 
When  these  were  80  percent  utilized,  the  adaptive  routing 
scheme  outperformed  the  optimum  nonadaptive  scheme  by  a 
factor  of  1 .6  for  time  delay  and  2.6  for  waiting  time.  When  the 
utilization  was  90  percent,  these  factors  were  2.2  and  almost 
3,  respectively. 

Consider  another  asymmetrical  eight-node  network,  shown 
in  Fig.  8.  Here  we  assumed  an  asymmetrical  traffic  matrix.  If 
only  shortest  paths  were  allowed,  little  traffic  has  choice,  a 
nonadaptive  and  our  adaptive  scheme  are  almost  identical,and 
the  time  delay  is  6.24  for  both.  The  optimum  nonadaptive 
juting  had  a  time  delay  of  4.80.  An  adaptive  routing  can  be 
found  using  the  same  length  paths  for  each  node-pair  traffic. 
This  resulted  in  a  time  delay  of  4j08-a  IS  percent  improve¬ 
ment.  There  was  not  enough  choice  to  get  more  improved 
performance  in  the  nodes.  When  more  choice  was  allowed, 
increasing  the  utilization,  the  time  delay  increased  to  4.62. 
The  service  time  here  was  equal  to  2.  Hence,  the  waiting  time 
for  shortest  paths  wu  A2A,  the  optimum  nonadaptive  routing 
had  a  waiting  time  of  2.80,  and  the  adaptive  routing  with 
equal  length  paths  had  a  waiting  time  of  2.08-a  25  percent 
improvement. 

More  study  is  needed  to  fully  understand  the  influence  of 
these  uymmetries  on  this  type  of  adaptive  routing. 

VIII.  CONCLUSIONS  AND  FURTHER  WORK 

We  have  presented  a  two-level  adaptive  routing  procedure 
that  exhibits  marked  improvement  in  performance  over  non¬ 
adaptive  routing.  Moreover,  we  have  presented  some  analytic 
approximations  to  estimate  this  improvement.  In  heavy 
traffle,  we  have  shown  the  improvement  to  be  by  a  factor 
equal  to  the  nodal  degree  (or  average  nodal  degree).  In  general, 
for  moderate  utilization,  a  large  part  of  this  improvement  is 
still  achieved. 

A  more  elaborate  simulation  of  a  network  is  underway.  In 
that  work,  the  independence  assumption  is  not  made.  Al¬ 
though  it  confirms  our  model,  it  has  shed  some  light  on  the 
“independence”  assumption.  We  found  that  at  high  uiilira 
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I  IK  7  An  eight-node  asymmetricil  network. 


Fig  8.  An  eight-node  asymmetrical  network  used  with  an  asymmetrical 
trafric  matrix. 
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tion  we  were  getting  most,  but  not  all,  of  the  expected  im¬ 
provement.  Many  different  nodal  strategies  had  only  a  modest 
effect  on  resolving  this  gap.  We  found  that  it  was  easy  to 
generate  strategies  that  produced  multiple  server  behavior.  But 
some  of  them  had  a  profound  effect  on  the  output  process 
from  the  server,  and  thus  affected  the  Poisson  assumption  for 
inputs  to  the  next  node.  The  priority  scheme  was  notorious  in 
this  regard  Usually  any  traffic  one  hop  away  from  its  destina¬ 
tion  will  be  placed  in  a  dedicated  queue.  Thus,  only  continuing 
traffic  will  be  found  in  nondedicated  queues.  These  have 
lower  priority  and  will  be  transmitted  only  when  the  dedicated 
queue  is  empty.  But  then  a  burst  of  several  packets  is  likely  to 
be  sent.  A  bursty  arrival  process  results  in  poorer  queue  per¬ 
formance  than  a  Poisson  process.  This  contributed  signifi¬ 
cantly  to  the  delay  performance  gap.  The  design  of  a  nodal 
strategy  must  consider  not  only  multiserver  performance 
achievement  at  a  node,  but  the  resultant  output  process  as 
well.  This  is  the  direction  of  our  current  research. 

The  usual  way  such  a  burst  could  occur  is  when  the  packets 
are  short.  But  these  should  have  made  the  next  node’s  job 
easier  Unfortunately,  the  second  aspect  of  the  independence 
assumption  now  comes  into  play— packet  lengths  are  reassigned 
at  each  node.  This  accentuates  the  effect  of  burstiness.  W'hen 
the  simulation  was  altered  to  take  into  account  dependence 
between  nodes,  hence  dispensing  with  the  independence  as¬ 
sumption,  the  difficulty  disappeared. 
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ABSTRACT 

A  dynamic  routing  scheme  Is  studied. 
Instead  of  assigning  fixed  paths  or 
randomly  selecting  paths,  permissible 
paths  (e.g.,  minimum  hop  paths)  are 
pre-asslgned  to  each  node  pair.  A 
packet  arriving  at  a  node,  and  destined 
for  another  node.  Is  assigned  to  one 
of  two  types  of  queues  -  a  dedicated 
one  or  a  shared  one.  The  objective  of 
the  routing  scheme  Is  to  reduce  both 
nodal  as  well  as  network  average  packet 
delay.  We  have  conducted  a  aeries  of 
simulation  studies  to  evaluate  the 
nodal  as  well  as  the  network  delay 
under  the  proposed  dynamic  routing 
scheme.  Several  Important  and  encouraging 
results  are  obtained  and  are  presented 
In  this  paper. 


1.  INTRODUCTION 

A  dynamic  routing  scheme  has  been 
previously  proposed  (References  [1],  [2], 
L3],.  and  [4]).  Instead  of  assigning 
fixed  paths  or  randomly  selecting  paths, 
permissible  paths  (e.g.,  minimum  hop 
paths)  are  pre-asslgned  to  each  node  pair. 
A  packet  arriving  at  a  node,  and  destined 
for  another  node.  Is  assigned  to  one  of 
two  types  of  queues.  A  dedicated  queue 
Is  a  queue  whose  packets  are  served  by 
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one  and  only  one  dedicated  server.  A 
shared  queue  Is  a  queue  whose  packets  may 
be  served  by  two  or  more  specified 
servers.  A  server  Is  equivalent  to  an 
output  link.  The  purpose  of  pre¬ 
selecting  paths  la  to  avoid  long  paths 
and  resultant  Increased  network  traffic 
that  would  occur  In  a  random  routing 
scheme.  The  purpose  of  the  shared  queue 
Is  to  let  a  portion  of  the  nodal  traffic 
experience  the  delay  of  a  multiple 
server  queue  and  therefore  reduce  the 
average  packet  delay  at  a  node.  The 
objective  of  the  model  Is  therefore  to 
reduce  both  nodal  as  well  as  network 
average  packet  delay. 

We  have  conducted  a  series  of  simulation 
studies  to  evaluate  the  nodal  as  well  as 
the  network  delay  under  the  proposed 
dynamic  routing  scheme.  A  more  detailed 
report  can  be  found  In  Reference  [5]. 
Section  2  describes  the  nodal  and  net¬ 
work  models  as  well  as  the  different 
service  disciplines  used.  Section  3 
gives  the  results  of  nodal  and  network 
delay  performance.  A  summary  Is  given 
In  Section  4 . 

2.  THE  MODEL 

2.1  The  Model  Model 

The  traffic  from  each  pair  Is  preassigned 
to  one  or  more  pre-selected  paths  (and 
therefore  server(s)  at  a  node).  Two 
types  of  queue  can  be  contructed  at  a 
node.  A  dedicated  queue  can  only  be 
served  by  one  dedicated  server.  There 
are  at  most  n  such  queues  at  a  node  with 
n  links.  A  shared  queue  can  be  served 
by  any  of  several  preassigned  servers. 
There  could  be  as  many  as  2J1  -  n  -  1 
shared  queues  at  a  node.  A  two  server, 
three  queue  nodal  model  Is  shown  In 
Figure  1. 
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FIGURE  1  A  NODAL  QUEUEING  MODEL  WITH  TWO 
SERVERS 


In  Figure  1,  traffic  arrives  at  the 
different  queues  with  arrival  rates 

(1  ■  1,2,3)  packets/sec.  Queue  1  and 

queue  3  are  competing  with  server  1  for 
service  while  queue  2  and  queue  3  are 
competing  for  service  by  server  2.  The 
service  rate  Is  u  packets/sec.  If  one 
of  the  competing  queues  (e.g.,  queue  1 
or  queue  3)  la  empty  (e.g.,  queue  1), 
and  the  server  for  these  queues  Is  idle, 
a  packet  in  the  non-empty  queue  Is 
served.  If  both  queues  are  not  empty 
when  the  server  has  Just  completed 
service,  a  pre-deflned  service 
discipline  will  determine  which  queue 
to  serve  next.  When  both  competing 
queues  are  empty  (e.g.,  queue  1  and 
queue  3)  and  the  server  for  these  queues 
Is  Idle,  the  server  will  stay  Idle  until 
there  Is  a  new  arrival  to  one  or  both  of 
these  queues.  The  Idle  server  will  not 
serve  the  queue  which  Is  not  pre- 
asslgned  to  It  for  service.  This  is  the 
major  difference  between  our  nodal  model 
and  a  multiple  server  queueing  system. 
The  operation  of  a  node  with  more  commu- 
catlons  links  Is  basically  the  same  as 
that  of  a  two  link  node.  Shared  queues 
will  be  served  by  a  subset  of  the 
servers. 


2 . 2  Service  Disciplines 

The  nodal  model  described  In  the  previous 
section  allows  different  types  of  queues 
to  compete  for  service.  A  service 
discipline  must  therefore  be  specified. 

We  will  describe  four  different  service 
disciplines  for  the  nodal  model  of 
Figure  1. 

1.  Priority  Service  Discipline  (PSD) 

A  server  always  searches  the 
dedicated  queue  for  service  prior 
to  Its  service  of  the  shared  queue. 


For  example,  server  1  In  Fl^rurc  1, 
after  conipletlns;  a  service,  will  ai^ 
ways  lock  to  queue  1  for  moi-o 
sei’vlce.  ['M'Kets  In  queue  3  will  be 
served  by  serve  1  only  If  queue  1  Is 
empty  when  server  1  has  ,lusi 
completed  a  service.  Therefore, 
server  1  Is  busy  as  long  as  there 
are  packets  to  be  served  In  queue 
1  or  queue  3.  Conversely,  server 

1  Is  Idle  and  stays  Idle  when  both 
queue  1  and  queue  3  are  empty.  The 
Instantaneous  queue  length  of  queue 

2  does  not  effect  any  of  the 
decision  rules  at  server  1 
described  above.  A  packet  entering 
the  shared  queue  when  both  servers 
are  Idle  Is  served  randomly  by 
either  server  1  or  server  2. 

2.  Alternate  Service  Discipline  (ASD) 

A  server  at  the  node  In  Figure  1, 
whenever  possible,  alternatively 
serves  n  packets  from  the  dedicated 
queue  and  m  consecutive  ones  from 
the  shared  queue.  The  server  Is 
forced  to  serve  more  than  n  (or  m) 
packets  from  the  dedicated  (or  the 
shared)  queue  If  the  other  queue 
happens  to  be  empty.  Under  this 
situation,  alternation  of  service 
will  begin  whenever  there  Is  a  new 
arrival  to  the  empty  queue.  The 
process  Is  started  over  again  at 
this  point. 

3.  Serve  The  Longer  Queue  Discipline 
(SLQD) 

The  queue  with  the  longer  queue 
among  the  two  competing  queues 
(the  shared  and  the  dedicated  queues 
gets  service  first.  A  server, 
upon  a  service  completion  compares 
the  queue  lengths  of  the  two 
competing  queues,  and  serves  a 
packet  from  the  longer  queue.  The 
SLQD  tries  to  equalize  all  the 
queue  lengths  at  a  node. 

4.  Random  Service  Discipline  (RSD) 

A  server  (e.g.,  server  1),  upon 
completing  a  service,  serves 
packets  from  one  of  two  competing 
queues  according  to  the  following 
algorithm:  It  serves  queue  1  If 
queue  3  Is  empty  and  vise  versa; 

It  randomly  selects  a  queue  If 
both  queue  1  and  queue  3  are  non¬ 
empty.  This  Is  a  totally 
"uncontrolled"  service  discipline. 

2.3  The  Network  Model 

In  order  to  describe  the  dynamics  of  a 

packet  switching  employing  the 

proposed  dynamic  routing  scheme,  one 


! 
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must  (1)  construct  nodal  queueing 
models  similar  to  the  one  In  Figure  1 
for  every  node  In  the  network,  (2)  pre¬ 
select  a  set  of  "permissible"  paths 
for  each  source-destination  node  pair, 
and  (3)  assign  the'data  flow  according 
to  the  preselected  paths  between 
adjacent  nodes.  In  order  to  simplify 
description  of  the  network  model,  we 
have  chosen  a  four  network  to  Illustrate 
the  dynamics  of  our  network  model. 

A  four  node  network  Is  shown  In  Figure 
2.  Every  node  is  connected  to  two  links, 
l.e.,  is  of  degree  two.  Therefore,  the 
nodal  queueing  model  at  each  node  Is 
exactly  the  same  as  that  of  the  two 
server  three  queue  model  described 
previously.  The  permissible  paths 
between  sources  and  destinations  are 
selected  based  on  a  shortest  path 
(minimum  hop)  algorithm.  Adjacent  nodes 
will  only  use  the  one  hop  path  between 
them.  Non-adjacent  nodes  will  use  either 
one  of  the  two-hop  paths.  Therefore, 
traffic  between  adjacent  nodes  Is  pre¬ 
assigned  to  the  dedicated  queues. 

Traffic  between  non-adJacent  nodes,  on  the 
other  hand.  Is  pre-asslgned  to  the  shared 
queue  at  the  source  node  and  the 
appropriate  dedicated  queue  at  the 
Intermediate  node. 


FIGURE  2  A  FOUR  NODE  NETWORK 


Each  server  at  the  node  serves  the  two 
competing  queues  (the  dedicated  queue 
and  the  shared  queue)  according  to  a 
pre-speclfled  discipline.  We  have 
simulated  the  network  for  the  four  service 
disciplines  discussed  previously.  A 
service  discipline  where  the  condition  of 
the  next  node  is  "known"  to  all  servers 
at  the  current  node  and  used  to 
determine  routing  Is  also  studied. 

3.  RESULTS 

Simulation  programs  In  PL/1  have  been 
written  to  evaluate  the  delay  of  the 
nodal  model  of  Figure  1  and  the  network 
model  of  Figure  2  operating  under  the 


different,  service  disciplines.  All  simu¬ 
lation  results  are  based  on  statistics 
collected  over  a  sufficient  time  Interval 
so  as  to  represent  the  steady  state. 

The  statistics  collected  for  the  PSD 
scheme  are  based  on  at  least  one  million 
(sometimes  four  million)  simulation 
events  -  Including  both  arrivals  and 
departures.  In  order  to  conserve 
computing  time,  the  statistics  collected 
for  other  service  disciplines  are  based 
on  at  least  two  hundred  thousand 
simulation  events.  In  our  simulation 
program,  all  arrival  processes  to 
source  nodes  are  assumed  to  be  Poisson 
and  all  service  distributions  are 
assumed  to  be  exponential  and  Independ¬ 
ent  of  each  other.  No  assumption  Is 
made  about  arrivals  processes  to  Inter¬ 
mediate  nodes  In  the  network  model. 

3.1  Model  Delay  Performance 

The  percentage  of  the  traffic  that  Is 
shared  at  a  node  Is  an  Important 
parameter  affecting  the  nodal  delay. 

We  have  studied  nodal  delay  based  on 
three  sets  of  nodal  traffic  mixes. 

They  are  low  (approximately  10  percent 
to  15  percent),  moderate  (25  percent), 
and  high  (50  percent)  percentages  for 
the  shared  traffic  at  a  node  respectively. 
Fairly  extensive  simulation  runs  were 
conducted  throughout  the  whole  range 
of  nodal  utilization  (0  to  1)  for  the 
PSD  and  a  moderate  percentage  of 
shared  traffic.  We  assumed  that  the  Input 
processes  to  the  node  were  Poisson. 

3.1.1  Simulation  Results  Of  The 
l>riorlty  Service  PlsclpTTne 

The  effects  of  the  percentage  of  the 
traffic  that  is  shared  on  nodal  aelay 
are  tabulated  In  Table  1.  Notice  that 
the  higher  the  percentage,  the  closer 
the  nodal  delay  Is  to  that  of  an 
M/M/2  queueing  system. 

Figure  3  plots  the  delay-throughput 
relationship  of  the  PSD  for  different 
percentages  of  shared  traffic.  T.he 
delay  for  the  M/M/2  and  the  M/M/1 
queueing  systems  are  also  plotted  for 
reference. 

In  Figure  a  measure  of  the  "close¬ 
ness"  of  the  PSD  to  M/M/2  queueing  system 

performance,  ,  Is  plotted 

%/M/1~2!m/m/2 

against  the  percentage  of  shared  traffic 
for  a  node  operated  under  the  PSD 
scheme.  The  utilizations  are  studied. 

The  ordinate  Is  the  percentage  of 
multiple  server  performance  that  Is 
obtained  by  the  PSD.  At  high  traffic 
utllltlzatlon  (p«0.875),  only  25  percent 
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UTILIZATION 


3. 


FIGURe  S  OCLAY-THROUOHRUT  RELATIONSHIP  FOR 
NODAL  MODEL 


3.1.2  Results  For  Other  Service 
Disciplines 

In  this  section,  we  discuss  the  nodal 
delay  for  other  service  disciplines.  For 
these  disciplines,  we  only  studied  the 
nodal  performance  for  p  ■  0.875  and  50 
percent  of  the  traffic  being' shared . 

Table  2  tabulated  the  nodal  delay  from  our 
nodal  simulation  for  the  ASD,  the  SLQD,  and 
the  F-SD.  We  can  make  the  following 
observations  from  Table  2: 

1.  A  general  characteristic  of  the 
results  In  Table  2  Is  that;  at 
high  utilizations  (p»0.875  In  this 
case)  with  a  high  percentage  of 
the  traffic  shared  (50  percent), 
the  nodal  delay  Is  closer  to  that 
of  the  M/M/2  than  the  M/M/1 
queueing  system  regardless  of  the 
service  disciplines. 

2.  The  PSD  has  the  best  nodal  delay 
among  all  service  disciplines. 


The  nodal  delay  of  the  3LQD,  with 
the  modification  of  not  serving 
(whenever  possible)  two  packets  In 
a  row  from  the  shared  queue,  is  the 
closest  to  that  of  the  PSD. 

The  ASD  with  parameter  x»l  and  the 
threshold  at  the  shared  queue  set 
to  zero,  has  almost  the  worst  nodal 
delay  performance. 

5.  The  RSD,  being  that  it  Is  an 

"uncontrolled"  service  discipline, 
does  perform  the  worst  among  all 
•ervlce  disciplines.  However,  the 
PRSD  performs  better  than  the 
N/M/1  queueing  system  by  almost 
<•0  percent. 

Table  2  Rodal  Simulation  Results 
2*1 .75  peckets/sec, 

lj“3.5  packets/sec,  w«l»  packets /sec. 

P-0.875) 


Service  Discipline 

lodal  Diclay 
(seconds ) 

PSD 

1.03 

SLqD 

Serve  the  Longer 
Queue  but  not  jr 
consecutive  "shau-ed 
queue"  packets 

- 

y  -  1 

l.OL 

y  ■  2 

1.09 

y  »  3 

i.oe 

y  -  - 

1.13 

ASD 

Alternate  Service: 
one  from  shared 
queue  and  x 
from  dedicated 
queue  but  do  not 
serve  shared 
queue  when  Its 
queue  length  ^  TH 

TH  -  0 

X  «  1 

1.11 

X  ■  2 

1.12 

X  -  3 

1-.05 

If 

s 

X  »  1 

1.09 

^  X  ■  2 

1.C5 

x  -  3 

1.07 

r 

TH  •  2 

X  »  1 

1755 

X  »  2 

1.05 

X  «  3 

l.OU 

TH  ■  3 

X  «  1 

1.06 

X  -  2 

1.07 

X  «  3 

1.05 

RSD  Random 

— r:i3 — 

M/M/Z 

1.01 

M/M/1 

2.00 

3.2  Network  Delay  Performance 
3.2.1  Analysis 

In  this  section,  we  study  the  average 
network  delay  performance.  In  our 
routing  strategy.  Instead  of  treating 
each  communications  line  separately,  we 
focus  our  Interest  on  the  whole  node. 
Including  all  its  communications  links. 
As  a  result,  our  focus  Is  on  nodal  delay 
as  opposed  to  link  delay  which  Is 
usually  studied. 


Let  us  assume  that  we  have  an  . rpresslon 
for  nodal  delay  at  node  J,  which  is 
denoted  by  Tj^.  The  average  number  of 
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packets  Nj^,  at  nodal  .1  can 


thi-r-iM'ore  be 


written  as  the 

total  traffic  ( fr.im  both  oxLernai  and 
Internal  traffic  aources)  at  node  J.  The 
average  number  of  paoKets  in  the  network 
(N)  Is  therefore  the  sum  of  the  average 
number  of  packets  at  each  node.  That  is, 
N  ■  t  N  average  network 

J.-  NODES  >1- 


delay  (N)  Is  therefore 


^  »  NODES  i 

Y 

where  y  ■  E  Is  the  total  external 

i,k 

network  traffic  demand  and  Is  the 

offered  rate  of  traffic  between  node  pair 
(J,k).  Equation  (1)  Is  therefore  a 
general  expression  for  the  average  network 
delay. 


To  analyze  the  network  delay  we  make  the 
following  assumptions.  We  assume  the  node 
models  used  previously.  The  traffic  Is 
found  by  applying  the  routing  Implied  by 
the  preassigned  paths.  We  can  then 
calculate  the  rate  In  packets  per  second 
of  arrivals  to  the  shared  and  dedicated 
queues.  In  our  network  analysis  we 
assume  these  arrivals  to  be  a  Poisson 
process  and  compute  the  nodal  delay  far 
each  node  using  the  techniques  described 
above.  Equation  (1)  gives  us  the  network 
delay.  Below  we  compare  this  technique 
with  the  results  of  two  simulations.  The 
first  dispenses  with  the  Poisson  assump¬ 
tion  and  accurately  models  arrivals  to 
Intermediate  nodes  in  paths.  To  make 
the  simulation  simpler,  packet  lengths 
are  Independently  reassigned  at  each  node 
In  a  path.  The  second  simulation 
dispenses  with  this  approximation  and 
retains  the  initial  packet  length  for  a 
throughput  Its  path. 

3.2.2  Simulation 

In  this  section,  we  will  focus  our  discus¬ 
sion  on  two  PL/1  simulations  which  were 
written  to  evaluate  the  network  delay 
performance.  The  interarrlval  time  of 
arrivals  to  network  ndoes  Is  assumed  to 
be  exponentially  distributed.  The 
packet  length  Is  assumed  to  be 
exponentially  distributed.  All  servers 
are  assumed  to  have  the  same  service 
capacity  (for  convenience  as  unity). 
Therefore,  the  average  service  rate  of 
each  server  Is  the  same.  In  most  of  our 
simulation  studies,  we  have  used 
P”0.875,  50  percent  shared  traffic  at 
the  ndoes,  and  a  symmetrical  network. 

Other  traffic  utilizations,  nodal  traffic 
nixes,  and  asymmetric  networ-k  traffic 


demands  are  also  studied  but  not  pz 
extensively  as  the  above. 

The  first  simulation  allow.-,  the  p.icke- 
lengths  to  be  reassigned  at  the  next, 
node.  It  does  not  make  any  assumpt  .V  n  ■. 
abcut  the  output  process  from  a  server. 

A  server  serves  Its  competing  queues 
according  to  a  pre-speclfled  service 
discipline.  After  the  server  flnlr.hc.s 
service  of  a  packet,  the  packet  either 
exits  from  the  network  or  Joins  the  next 
queue  for  more  service.  Once  It  arrive., 
at  the  next  queue  (In  the  next  node),  a 
new  packet  length  from  the  same  expo¬ 
nential  distribution  Is  assigned  to  It. 

It  then  Is  put  on  the  appropriate  :(ueue 
and  waits  for  service.  We  compare  the 
results  of  our  network  simulation  with 
our  analysis  based  upon  nodal  simulation 
results.  In  the  latter,  we  assume  all 
Inputs  to  the  node  to  be  Poisson. 

The  second  simulator  simulates  the  true 
service  situation  In  an  operating 
packet  network.  That  is,  a  particular 
packet  when  served  by  the  next  server 
will  require  the  same  amount  of  service 
time.  When  the  results  obtained  from 
the  second  simulation  are  compared  wlt.h 
our  analysis  by  using  cur  nodal  slmulatl!.:. 
results,  we  can  evaluate  the  effect  cf 
both  the  Poisson  output  stream  and  the 
packet  length  reassignment  assumptions. 

3. 2. 2. 3  The  First  Network  Simulation 

3. 2. 2. 1.1  Local  Service  Disciplines 

All  four  service  disciplines  are  studied. 
The  simulation  results  in  Tabxe  3  and  In 
previous  work  (Reference  [5])  reveals 
the  following: 

1.  The  average  network  delay  from  the 
network  simulation  Is  significantly 
closer  to  that  of  the  .V/M/2  nodal 
than  to  the  M/M/1  nodal  model. 

2.  The  difference  between  the  analyt¬ 
ical  results  using  the  nodal  model 
and  those  obtained  from  the  network 
simulation,  for  the  PSD,  increases 
as  the  percentage  of  shared  traffic 
increases  at  a  node. 

3.  With  the  same  traffic  mix  at  a  ncdo 
the  difference  between  the  networ-: 
delay  found  analytically  and  those 
obtained  from  the  r.etv;ork  slmulilLn 
for  the  PSD  Increases  as  the  traffic 
load  increases. 

The  network  delay  obtained  from  the 
network  simulation  for  the  Altertiato 
Service  Discipline  (ASD)  and  the 
Serve  the  Longer  Queue  Dlscli'llne 
(SLQD)  are  lower  than  that  of  Kr- 
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PSD  scheme.  The  difference  In  these 
delay  are  at  most  11  percent.  The 
network  delay  of  the  Random  Service 
Discipline  (RSD)  scheme  Is 
approximately  the  same  as  that  of 
the  PSD  scheme. 

5.  The  difference  between  the  results 
obtained  analytically  and  from 
our  network  simulation  for  other 
service  disciplines  is  sub¬ 
stantially  large  (12  to  26  per¬ 
cent  difference). 

6.  The  parameter  n  (the  number  of 
packets  to  be  served  from  the 
dedicated  queue  before  the  server 
alternate  its  service)  in  the  ASD 
has  an  effect  on  network  delay. 

7.  In  the  ASD,  the  threshold  para¬ 
meter  set  at  the  shared  queue  to 
avoid  the  shared  queue  becoming 
empty  before  the  dedicated  queues, 
also  has  an  effect  on  network 
delay. 

8.  The  network  delay  of  the  RSD  -  an 
"uncontrolled"  and  "unpredictable" 
service  discipline  -  is  about  the 
same  as  that  of  the  PSD  and  the 
"worst  cases"  of  the  ASD  and  the 
SLQD. 

Although  we  are  operating  at  high  utili¬ 
zation,  we  are  getting  most,  but  not  all, 
of  the  expected  Improvement.  Many 
different  nodal  strategies  have  only  a 
modest  effect  on  resolving  this  gap. 

We  find  that  is  was  easy  to  generate 
strategies  that  produced  multiple 
server  behavior.  But  some  of  them  have 
a  profound  effect  on  the  output  process 
from  the  server,  and  thus  affect  the 
Poisson  assumption  for  inputs  to  the 
next  node.  The  PSD  is  notorious  In  this 
regard.  Usually  any  traffic  one  hop 
away  from  Its  destination  will  be  placed 
In  a  dedicated  queue.  Thus,  only 
continuing  traffic  will  be  found  In 
nondedlcated  queues.  These  have  lower 
priority  and  will  be  transmitted  only 
when  the  dedicated  queue  Is  empty.  But 
then  a  burst  of  several  packets  Is  likely 
to  be  sent.  A  bursty  arrival  process 
results  In  much  poorer  queue  performance 
than  a  Poisson  process.  This  contributes 
significantly  to  the  gap  In  delay.  The 
design  of  a  nodal  strategy  must  consider 
not  only  multlserver  performance  at  a 
node,  but  the  resultant  output  process 
as  well.  The  usual  way  such  a  burst 
could  occur  Is  when  the  packets  are 
short.  But  these  should  have  made  the 
next  node's  Job  easier.  Unfortunately, 
the  second  aspect  of  the  Independence 
assumption  now  comes  Into  play  -  packet 
lengths  are  reassigned  at  each  node. 


This  accentuates  the  effect  of  burst  Incas. 
When  the  simulation  Is  altered  to  take 
into  account  dependence  between  nodes, 
hence  dispensing  with  the  Independence 
assumption  (as  described  In  the  next 
section),  the  difficulty  disappeared. 

3 • 2 . 2 . 1 . 2  A  Look  Ahead  Service  Scheme 

A  common  situation  of  all  the  "local" 
service  disciplines  (The  PSD,  the  ASD, 
the  SLQD,  and  the  RSD)  Is  that  a  packet 
may  be  waiting  to  be  served  In  the  shared 
queue  at  one  node  while  there  is  an  Idle 
server  at  the  next  node.  Conversely,  a 
packet  may  wait  In  the  dedicated  queue 
for  service  when  there  Is  no  need  to 
serve  a  shared  queue  packet  because  the 
next  server  has  plenty  of  packets  to  serve 
at  the  next  node.  Both  phenomena  cause 
degradation  in  network  performance.  Both 
are  due  to  the  fact  that  each  server,  upon 
completing  a  service,  does  not  know  what 
the  next  server  wants.  Consequently,  the 
server  may  serve  a  "wrong"  packet  from 
time  to  time.  Therefore,  It  Is  not  the 
selection  of  a  queue  but  the  timing  of 
this  selection  process  which  Influences 
the  service  decision.  In  all  of  the 
"local"  service  disciplines  which  we  have 
discussed  so  far,  the  timing  factor  was 
not  considered.  We  have  designed  a  look 
ahead  service  discipline  and  have 
examined  Its  performance.  The  basic 
algorithm  Is  shown  In  Figure  6. 
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FIGURE  6  A  LOOK  AHEAD  SERVICE  SCHEME 
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Table  3  Network  Simulation  Results 

^^11*^12"^  packet/soc,  packets/sec,  packo ‘.s/sec ,  p  =  0.87‘j) 


Service  Discipline 

Network  Delay  (seconds) 

i  Difference* 

Nodal  Model 

Slmulat ion 

PSD 

2.06 

2.7s 

33  J 

ASD 

Alternate  Service: 
one  from  shared 
queue  and  n 
from  dedicated 
queue  but  do 
not  serve 
shared  queue 
when  Its  queue 
length  £  TH 

TH  -  0 

n  -  1 

2.22 

2.65 

ion 

n  ■  2 

2725 

2.51 

I2i 

n  -  3 

2.10 

2.58 

23X 

TH  -  1 

n  «  1 

2.18 

2.50 

15X 

n  -  2 

2.10 

2.50 

1?X 

n  -  3 

2.111 

2.53 

i8i( 

TH  -  2 

n  -  1 

2.16 

2.5? 

20J 

n  -  2 

2.10 

i.52 

0 

n  -  1 

2.08 

2.62 

26i 

TH  -  3 

1 

c 

2.12 

2.51 

18X 

n  -  i 

2.14 

2.40 

17% 

n  -  3 

2.10 

2.55 

21% 

SLQD 

Serve  the  Longer 
queue  but  not  y 
consecutive 
shared  queue 
packets 

y  -  1 

2.08 

2.45 

1B% 

V  -  2 

ItlB 

2.56 

lOX 

y  -  3 

2.16 

2.67 

24S 

y  -  • 

ms 

07 

1$< 

RSD 

2.26 

2.71 

20X 

M/M/2 

2.02 

M/M/1 

Coo 

- 

- 

%  Difference 


^WETWOHK  (Simulation)  -  "^NETWORK  (Nodal  Model) 
'^NETWORK  (Nodal  Model) 


Table  4  show  our  results.  We  make  the 
following  observations : 

1.  The  network  delay  of  the  best 
look  ahead  scheme  has  an  Improve¬ 
ment  of  about  11  percent  over  that 
of  the  beat  local  scheme. 

2.  The  look  ahead  service  scheme  per¬ 
forms  consistently  better  than  the 
local  service  disciplines. 

3.  When  both  servers  are  empty,  It  Is 
better  to  serve  an  arrival  to  the 
shared  queue  on  the  basis  of 
downstream  information,  than  to  do 
It  randomly. 

3.2. 2.2  The  Second  Network  Simulation  - 
No  Packet  Length  ReasslgnmenF 

The  analytically  obtained  network  delay 
and  our  first  simulation  differ  not 
only  In  the  PSD  but  also  In  the  ASD  and 
the  SLQD  where  the  output  stream  should 
be  somewhat  smoother.  Furthermore,  as 
was  demonstrated  by  Klelnrock  (Reference 
[5]),  the  packet  length  reassignment 
assumption  does  not  work  well  in  a  small 
network  like  our  four  node  network  with 
very  few  traffic  streams  flowing  at  each 


node.  This  motivated  us  to  build  the 
second  network  simulator  whereby  the 
packet  length  reassignment  assumption 
is  not  made. 

3. 2. 2. 2.1  Dynamic  Routing  Scheme  With 

Local  Node  Service  Disciplines 

We  have  used  our  second  simulation  to 
evaluate  the  network  delay  performance 
under  the  various  service  disciplines 
(PSD,  ASD,  SLQD,  and  RSD)  discussed 
previously.  Each  node,  operates 
Independently,  serves  Its  queues  accord¬ 
ing  to  the  pre-speclfled  service 
discipline.  Traffic  Information  at  each 
node  is  not  exchanged  between  adjacent 
nodes.  Therefore,  only  "local"  nod- 
service  disciplines  are  discussed  In  this 
section.  We  have  also  examined  the  aver¬ 
age  network  delay  under  two  fixed  routing 
strategies  (Reference  [5]). 

Table  5  tabulates  our  results.  We  make 
the  following  observations; 

1.  Removing  the  packet  length  assump¬ 
tion  greatly  Improves  the  accuracy 
of  our  analytic  nodal  method. 
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Table**  Network  Slmulnt  1  or'  ulto  0!'  A  Look  Ahcail  ^tralogy 

0  pac  ke  t ^  ec  ,  packo-Ls/aec  ,  paokeij/:;oc,  p  =  0.^’'  ) 


Threshold 

Network  Delay  (seconds) 

*2  °  ^21* 

^2  '  *^21** 

*2  °  ®21  2  23** 

0  (PSD) 

2.75 

2.75 

1 

2.33 

2.26 

2.21 

2 

2.35 

2.29 

2.19 

3 

2.i7 

2.32 

2.26 

S 

2.  Vj 

2.^2 

2.32 

5 

- 

2.35 

- 

- 

2.41 

•  When  both  and  S^2  busy  and  there  Is  an  arrival  to 

packet  will  be  served  by  either  or  Sj^2* 


••  Under  the  same  situation  as  above,  will  be  compared  with  (the  same- 
parameter  at  node  **). 

The  packet  In  will  be  served  by  (of  If 

X2  <  Xj,  (or  X2  >  Xjj ) 


2.  The  packet  length  reassignment 
assumption  has  more  effect  on  the 
network  delay  performance  for  the 
PSD  than  any  other  service 
disciplines. 

3.  The  PSD  Is  the  best  "local"  service 
disciplines  among  all  other  local 
service  disciplines. 

*» .  The  network  delay  of  our  dynamic 
routing  scheme  under  any  one  of 
the  service  disciplines  Is 
significantly  better  than  either 
of  the  benchmark  fixed  routing 
strategies . 

3. 2. 2. 2. 2  Dynamic  Routing  With  Look 
Ahead  Service  Scheme 

The  "clock  driven"  network  simulation 
keeps  track  of  how  much  time  a  packet 
requires  from  a  server.  This  enables 
a  server  to  know  how  much  work  (In  terms 
of  service  time)  the  next  server  has  at 
any  given  Instant  of  time.  A  server  can 
make  an  accurate  decision  to  keep  the 
network  from  losing  Its  service  capacity. 
A  server,  upon  completing  a  service, 
compares  the  total  amount  of  work  at  the 
next  server  (the  remaining  service  time 
for  the  packet  still  In  service  at  the 
next  server,  the  total  service  time  in 
the  dedicated  queue  at  the  next  node, 
the  total  service  time  In  the  shared 
queue  at  the  next  node,  and  a  threshold 
parameter  which  Incorporates  the  fact 


that  the  shared  queue  packets  are  served 
by  both  servers  at  the  next  node  and  there 
are  new  arrivals  to  the  next  node  during 
the  service  time)  to  the  sum  of  th'.'  scnv;:-- 
times  of  the  first  packet  in  each  ?f  Its 
competing  (shared  and  dedicated)  jueues. 
The  server  serves  the  shared  queue  packet 
If  the  total  amount  of  work  at  the  next 
server  Is  less  than  the  sum  of  the  service 
times  of  the  first  packet  in  each  of  Its 
competing  queue.  Should  the  server  not 
serve  the  shared  queue  packet  irrunedlately , 
the  next  server  could  be  Idle  for  a  period 
of  time  equal  to  the  difference  between 
these  two  quantities.  The  server  would 
otherwise  serve  the  dedicated  queue 
packet.  This  look  ahead  service  scheme 
would  be  Ideal  In  our  dynamic  routing  if 
the  threshold  parameter  could  be 
correctly  set.  The  network  delay  with 
both  exponentially  diotrltutea  arui 
constant  packet  lengths  are  evaiuatei. 

The  following  observations  can  be  made 
(see  Table  6): 

1.  The  PSD  is  equivalent  to  setting 
the  threshold  value  to  negative 
infinity. 

2.  The  network  delay  is  1  r.senr  i  t  i  ve 
to  the  threshold  values  near  its 
"optimal"  setting. 

3.  The  difference  between  the  network 

delay  of  the  "test"  local  service 
discipline  (the  P.'^D)  nil  ’.he  look 
ahead  service  scheme  wltr.  the  " 


choice  of  the  threshold  value  Is 
about  10  percent. 

Table  5  Network  Simulation  Results  Kor  Different  Service  Disciplines  - 
Local  Decision 

P^’^*^®^/3econd ,  Xj^2"3.5  packets/second,  packets/secona 

P«0.875) 


Service 

Dlsclnllne 

Network  Delay  (seconds)  1 

First 

Simulation 

Second 

Simulation 

Analvt Ic 

PSD 

2.75 

2.15 

ASD 

2.65 

2.38 

2.22 

SLQD 

2.32 

2.08 

RSD 

2.73 

2.56 

2.26 

Fixed  Routing 
#1 

3.8i< 

— 

Fixed  Routing 

#2 

3.26 

-- 

Table  6  Network  Simulation  Results  For  A  Look  Ahead  Routing  Strategy 

(Xii»Xi2“0  packet/second ,  packets/second,  packets/ 

second,  p=0.875 


Routing  Strategy 

Network  Delay  (seconds) 

Exponential  Packet 
Lengths 

Constant  Packet 
Lengths 

Local 

PSD 

2.16 

1.23 

RSD 

2.56 

1.31 

Look 

Ahead 

TH 

-0.25 

1.96 

1.17 

0.00 

1.9« 

l.lU 

0.05 

irt? 

0.10 

1.9ii 

•  • 

0.15 

1.95 

0.25 

1796 

1.20 

0.50 

2.01 

1.27 

0.75 

2.07 

1.33 

1.00 

2.13 

1.38 

3.2. 2.2.3  Asymmetric  Traffic  Demands  And 
Service  Rates 

We  have  Illustrated  significant  Improve¬ 
ments  In  network  delay  using  our  adaptive 
techniques  for  the  four  node  network. 
However,  only  symmetric  traffic  loads 
have  been  studied  so  far.  In  order  to 
demonstrate  the  robustness  of  our 
routing  strategy,  a  certain  degree  of 
asymmetry  was  introduced  Into  the  network. 
We  have  evaluated  tne  network  delay  of 
the  four  node  symmetric  network  with 
asymmetric  traffic  and  with  different 
link  capacities. 

We  make  the  following  observations  from 
the  results  of  Tables  7  and  8; 

The  RSD  performs  the  worst  among  all 
service  disciplines;  the  PSD  performs 
the  best  among  all  routing  strategies 


based  on  local  traffic  Information;  the 
"best"  look  ahead  scheme  performs 
better  than  the  PSD  In  terms  of  aver¬ 
age  network  delay;  and  the  threshold 
value  set  In  the  look  ahead  scheme  Is 
Insensitive  to  the  network  delay 
performance  over  a  wide  range. 

J).  CONCLUSION 

We  have  studied  a  dynamic  routing  scheme 
for  packet  switching  networks.  Its  net¬ 
work  delay  performance  Is  ourerl  r  t 
that  of  fixed  routing.  We  have  also 
developed  and  evaluated  an  analytic 
technique  for  evaluating  adaptive  routlt.g 
performance.  Extensive  simulations  con¬ 
firm  these  results.  A  locally  adaptive 
scheme  performs  almost  as  well  as  one 
that  has  more  Information. 


Table  7  Network  Delay  With  Asymmetric  TrafTlc  Dvnar.do 

packets/sec,  packet/sec,  packu  Ls/sec , 

packets/sec,  p»0,875) 


Table  6  Network  Delay  With  Asymmetric  Service  Rates 

(lll*Xl2*0  packets/sec,  Xj^2*3.5  packets/sec,  ^ 

packets/sec,  Wi2“‘'2l“''32”*‘4l"^  packets/sec,  p-O.875 
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RESEARCH  SUMMARIES 


A.  Studies  in  Adaptive  Routing 
A.l.  Stable  Routing  Patterns 

As  part  of  a  general  investigation  in  the  area  of  dynamic  routing 
in  computer  communication  networks  we  consider  the  problem  of  finding 
stable  global  routing  patterns.  Specifically,  we  have  proposed  [1]  a 
two-level  routing  procedure  where  the  lower  (local)  level  adapts  dy¬ 
namically  to  instantaneous  variations  in  the  congestion  of  the  network  in 
the  immediate  vicinity  of  each  node  and  the  higher  (global)  level  en¬ 
sures  stability  by  keeping  the  average  load  across  the  entire  network  in 
some  sense  globally  balanced.  We  now  consider  this  latter  problem. 

We  consider  as  a  measure  of  global  balance  the  utilization  of  the 
most  heavily  utilized  network  element  (node  or  link)  and  seek  to  mini¬ 
mize  this  quantity.  For  simplicity,  ws  will  speak  only  of  link  utiliza¬ 
tions.  (Node  utilizations  can  be  included  in  a  straightforward  manner.) 
Thus,  we  are  given  a  network  containing  N  nodes  and  L  (directed) 
links.  Each  link,  1,  has  a  capacity  Cj.  There  are  (directed)  require¬ 
ments  r.|  between  nodes  i  and  j.  Each  r^^  is  satisfied  by  routing  it  on 
one  or  more  paths  /j  from  i  to  j.  (In  the  two  level  adaptive  routing 
scheme,  these  paths  (or  links  within  them)  will  be  the  alternatives  open 
to  each  requirement.)  A  ro  iting  pattern  is  defined  by  the  paths 
and  the  fraction,  of  r..  using  each  The  utilization  of  each 

link  is  equal  to  the  total  flow  (sum  of  fractions  of  requirements)  on  the 
link  divided  by  its  capacity. 

The  maximally  utilized  link  is  in  a  sense  the  most  vulnerable  part 
of  the  network  and  the  most  likely  cause  for  the  dynamic  routing  mech¬ 
anism  to  break  down  (e.g.  loop)  due  to  congestion.  By  minimizing  the 
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utilization  of  the  maximally  utilized  link  we  seek  to  minimize  the  chance 
of  congestion  leading  to  such  failure.  It  should  be  noted  ihat  we  are 
dealing  with  the  global  level  of  the  routing  procedure  here  and  as  such 
consider  only  long  term  average  utilizations,  not  instantaneous  meas¬ 
ures.  The  local  level  of  the  routing  procedure  concerns  itself  with 
making  decisions  instantaneously  on  the  basis  of  the  local  stale  of  the 
network  in  the  vicinity  of  a  node.  Even  within  the  constraints  of  a 
given  P^^^^and  f^y^,  defined  by  the  global  strategy,  the  local  level  has 
considerable  flexibility  in  choosing  when  to  use  each  route  and  as  such 
can  obtain  substantial  reductions  in  delay  when  compared  with  static 
routing  policies. 

We  now  turn  to  the  problem  of  actually  finding  the  optimal  global 
routing  pattern  as  defined  above.  The  technique  resembles  the  Flow 
Deviation  Method  of  Cantor  and  Gerla  [2]  and  will  bo  described  in 
similar  terms.  Cantor  and  Gerla  sought  to  minimize  the  average  delay 
whereas  we  seek  to  minimize  the  maximum  utilization  of  a  link.  Both 
functions  are  convex  functions  over  a  convex  region  and  as  such,  the 
same  type  of  procedure  can  b.e  proven  to  yield  an  optimal  routing 
pattern.  The  proof  is  given  in  [2]. 

Our  function  is  only  piecewise  differentiable  and  as  such,  the 
gradient  search  used  in  [2]  is  not  appropriate.  In  fact,  the  alternative 
described  here  takes  the  special  nature  of  the  objective  function,  a 
minimum  of  linear  functions,  into  account  and  not  only  overcomes  its 
non-differentiability  but  also  is  considerably  more  efficient  and  easier 
to  implement  than  a  gradient  search. 

We  now  give  an  outline  of  the  optimization  procedure.  A  high 
level  flowchart  of  this  procedure  is  given  in  Figure  1.  As  mentioned 


above,  at  this  level  the  procedure  is  almost  identical  to  the  Flow  De¬ 
viation  Algorithm.  The  key  difference,  which  is  only  evident  in  a  more 
detailed  description,  is  how  the  optimal  superposition  of  flows  is  found. 

We  define  the  length  of  a  link  to  be  its  utilization.  Initially,  we 
set  all  link  lengths  to  0.  We  define  the  length  of  a  path  to  be  the 
length  of  the  longest  link  in  the  path  plus  a  small  constant  times  the 
number  of  links  in  the  path.  This  latter  term  is  added  to  break  ties 
among  paths  with  equally  utilized  links  in  favor  of  a  path  with  the 
smallest  number  of  links.  Note  that  this  definition  of  path  length  is 
different  from  the  conventional  one  but  it  serves  our  purpose.  Short¬ 
est  paths  using  this  metric  are  computable  using  conventional  shortest 
path  algorithms. 

The  routing  pattern  found  at  each  stage  in  the  optimization  proce¬ 
dure  is  a  single  shortest  path  for  each  requirement  r...  (In  general, 
this  path  is  not  unique,  but  this  poses  no  problem.)  A  flow  pattern  is 
defined  as  the  total  flow  in  each  link  and  is  found  by  loading  the 
requirements  onto  the  links  specified  in  the  current  flow  patterns. 

An  optimal  superposition  of  the  current  flow  pattern  with  all  pre¬ 
vious  flow  patterns  is  then  found.  This  is  the  key  step  in  the  proce¬ 
dure  and  is  done  by  finding  the  value  of  A  between  0  and  1  which 
minimizes  V,  the  maximum  link  utilization,  where  \  represents  the  frac¬ 
tion  of  all  previous  flow  patterns  used.  The  new  optimal  superposition 
of  flows  is  then  A  times  the  previous  superposition  plus  (1-A)  times  the 
current  flow  pattern.  A  new  superposition,  and  hence  link  utilizations, 
is  then  obtained.  This  in  turn  yields  a  new  value  for  V(K)  and  the 
link  lengths.  We  can  now  start  another  iteration.  If,  however,  no 
improvement  in  V(K)  has  been  observed,  the  iteration  has  converged 


and  we  terminate  the  procedure  with  an  optimal  flow  pattern.  By 
saving  the  routing  patterns  and  ,  the  values  of  \  for  each  K,  the 
optimal  routing  pattern  can  be  obtained.  In  particular,  if  is  the 
(i,j)  path  first  used  in  the  routing  pattern  in  iteration  K  then  the 
fraction  of  commodity  (i,j)  using  P^^^^is 

j=K+l 

where  M  is  the  number  of  iterations  and  the  product  is  defined  equal  to 
1  for  K  =  M. 

We  now  turn  to  the  problem  of  how  to  find  the  optimal  super¬ 
position  of  flows.  Consider  two  flow  patterns,  and  Each 

flow  pattern  assigns  a  flow  to  each  link.  Thus  and  f^^.^are  the 
flows  assigned  to  link  (i,j)  in  and  F^^\  respectively.  For  any  A 

between  0  and  1,  the  flow  assigned  to  link  (i,j)  by  superposing  AF^^^ 
and  (l-A)F^^^  is  then 

(l-A)f^f.^ 

which  equals 

Mf(».  f(2), 

which  is  a  linear  function  of  A.  Dividing  by  Cj,  to  obtain  utilizations, 
there  will  be,  in  general,  a  different  function,  aj  +  bjA  for  each  link  1. 
(We  will  for  simplicity  refer  to  links  by  a  single  index,  1,  rather  than 
endpoints  (i,j).) 

We  seek  the  value  of  A  which  minimizes  the  maximum  of  these 
functions  over  all  1.  Several  simple  observations  allow  us  to  find  this 
value  of  A  in  an  efficient  and  straightforward  manner.  First,  if  for  two 
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links,  1  and  m,  a,  >  a  and  b,  >  b  then  link  1  is  said  to  dominate  link 
m  and  link  m  can  be  ignored  as  it  clearly  does  not  participate  in  the 
maximum  since  a,  +  b,\  >  a  +  b  A  for  all  values  of  A.  Indeed,  if  a,  + 

biA  >  +  b^A  for  all  A  between  0  and  1  then  link  m  may  be  ignored. 

Note  that  this  latter  condition  is  not  equivalent  to  the  former,  for 
example  if  aj  =  10,  bj  =  0,  =  2,  and  b^  =  3. 

We  can  then  arrange  the  links  1  in  descending  order  of  aj  and 

examine  the  b, .  Any  link  m  for  which  b„  does  not  exceed  b,,  where  1 

1  ^  m  r 

is  the  predecessor  of  m  in  the  order,  can  be  eliminated.  We  now  have 
an  ordering  of  links  which  is  descending  in  aj  and  ascending  in  bj.  We 
then  compute,  for  each  adjacent  pair  of  links  1  and  m,  the  value  Aj  for 
which  aj  +  bjAj  =  The  Aj  should  form  an  ascending  se¬ 
quence.  A  value  of  Aj^  which  is  less  than  the  value  of  its  predecessor 

in  the  sequence  corresponds  to  a  link  1  which  can  be  eliminated  from 
further  consideration.  In  this  case,  link  1  is  eliminated  and  m  has  a 
new  predecessor.  The  Ap  is  then  recomputed  for  link  p  the  new  pre¬ 
decessor  of  m  and  this  process  is  repeated.  For  link  n,  the  last  link 
in  the  sequence,  A^  =  1. 

We  now  compute  for  each  remaining  link  1  Vj  =  aj  +  bjAj  and  select 

the  minimum  of  these  values.  The  resulting  Aj  and  Vj  are  the  desired 

values  yielding  the  optimal  superposition. 

This  entire  process  is  illustrated  in  Figure  2.  The  links  have 
been  sorted  so  that  the  aj  are  descending.  Links  with  nonascending  bj 
have  already  been  eliminated.  Thus  the  bj  form  an  ascending  sequence. 
This  is  evident  in  Figure  2  by  the  fact  that  the  lines  form  a  sequence 
increasing  in  slope.  The  intersection  of  lines  1  and  2  (i.e.  the  lines 
starting  at  aj^  and  32)  defines  Aj.  Similarly,  the  intersection  of  lines 


2  and  3  defines  K2  and  ^2  ^  far  no  line  is  dominated.  The 

intersection  of  lines  3  and  4  takes  place  between  \  =  1  so  line  4  is 
dominated  by  line  3  and  line  4  is  thus  eliminated  from  further  consider¬ 
ation.  The  intersection  of  lines  3  and  5  defines  a  value  of  (dotted 
line),  but  when  is  computed  we  find  it  to  be  less  than  A^.  So,  line 
5  is  dominated  by  line  6  and  removed  from  further  consideration.  A^  is 
then  recomputed  from  the  intersection  of  lines  3  and  6.  Ag  is  computed 
from  the  intersection  of  lines  6  and  7.  Finally  A^  =  1.  This  leaves  us 
with  A^,  A2,  A^,  Ag,  and  (also  Ag  =  0).  We  search  among  the  cor¬ 
responding  Vj  and  find  v^  is  minimum.  It  and  A^  define  the  desired 
superposition. 

The  entire  optimization  process  is  illustrated  in  Figures  3,  4,  and 
5.  The  network  consisting  of  3  nodes,  6  links  and  6  requirements  is 
shown  in  Figure  3.  For  simplicity,  we  assume  symmetric  requirements 
and  link  capacities.  We  can  thus  assume  a  symmetric  solution,  i.e., 
routes  for  r-  the  reverse  of  routes  for  rjj  and  equal  utilization  of  each 
link  in  both  directions.  This  allows  us  to  only  consider  3  links  and  3 
requirements  in  the  example.  This  is  done  to  simplify  the  example. 
The  actual  procedure  works  with  directed  links  and  requirements.  It 
can  also  be  used  with  undirected  links  and  requirements  but  such  a 
situation  is  rarely  physically  meaningful. 

Initially,  all  requirements  are  routed  directly  since  the  initial 
shortest  paths  by  our  definition  would  be  the  paths  with  the  minimum 
number  of  links.  This  is  illustrated  in  Figure  4a.  The  link  lengths 
are  then  recomputed  --  link  1  has  a  utilization  of  .3  and  hence  a  length 
of  .3,  etc.  The  shortest  paths  are  then  recomputed  and  are  shown  in 
Figure  4b.  Note  that  the  shortest  path  from  B  to  C  is  now  B-A-C. 


rhe  requirements  are  loaded  onto  these  paths.  The  flow  pattern  is 
shown  in  Figure  4c. 

Now  a  superposition  of  the  flow  patterns  in  Figures  4a  and  4c  is 
done.  Figure  5  illustrates  the  dynamics  of  this.  Note  that  r^^  domi¬ 
nates  r^g  and  that  the  optimal  A  is  .9  and  v  =  4.5.  Figure  4d  shows 
the  resultant  routing  pattern  formed  by  using  the  first  routes  for  90% 
of  the  traffic  and  the  second  routes  for  the  remaining  10%.  Figure  4e 
shows  the  flow  pattern  resulting  from  the  superposition.  Note  that  the 
maximum  utilization  is  .45  (in  links  (A,C)  and  (B,C))  which  is  less 
than  the  maximum  in  either  of  the  patterns  in  Figures  4a  and  4c. 

We  now  recompute  the  link  lengths  and  the  shortest  paths.  The 
resultant  routes  are  the  same  as  in  Figure  4a.  An  optimal  superposition 
between  this  flow  pattern  and  the  one  in  Figure  4e  is  then  done.  The 
optimal  value  of  A  is  1,  no  improvement  in  v  is  found  and  we  conclude 
that  the  routing  and  flow  pattern  in  Figures  4d  and  4e  are  optimal. 
Note  that  the  links  (A,C)  and  (B,C)  are  both  maximally  utilized.  They 
form  a  cut  which  is  analogous  to  the  saturated  cut  in  Gerla's  Cut  Satu¬ 
ration  Method.  (The  existence  of  such  a  cut  is  a  necessary  condition 
for  the  optimality  of  a  flow  pattern.) 

We  thus  have  developed  a  simple  and  efficient  algorithm  for  obtain¬ 
ing  stable  flow  patterns  for  use  globally  as  the  higher  level  in  our  2 
level  adaptive  routing  procedure.  In  the  coming  months  we  hope  to 
implement  this  procedure  and  experiment  with  it. 

In  an  allied  study  we  investigated  a  pattern  for  placing  virtual 
calls  on  a  network.  A  simulation  program  was  written  to  directly  ob¬ 
serve  the  dynamic  performance  of  an  algorithm  which  loads  calls  on 
alternate  routes  according  to  the  following  algorithm: 


1.  Load  each  incoming  call  onto  the  route  currently  carrying  the 
smallest  number  of  calls.  (The  number  of  calls  carried  by  a 
route  is  defined  for  the  purposes  of  this  algorithm  to  be  the 
number  of  calls  on  the  first  link  in  the  route.) 

2.  If  there  is  a  tie  among  several  routes  in  a  set,  S,  select 
route  i  with  probability  Pj(S). 

The  simulation  was  written  to  provide  us  with  a  first  glimpse  of 
the  dynamic  performance  of  such  a  procedure  as  a  guide  for  further 
research  in  this  area.  We  thus  wanted  to  keep  it  as  simple  as  possible 
and  considered  a  3  node  6  link  network  as  shown  in  Figure  5a  with 
symmetric  requirements.  The  program  can  easily  be  expanded  to  con¬ 
sider  more  general  cases  but  we  chose  this  simple  one  initially  in  order 
not  to  obscure  the  basic  results. 

Calls  arrive  at  each  node  at  a  rate  A.  (Poisson)  and  are  served  at 
rate  |j  (exponential)  by  the  links,  i.e.,  have  exponential  duration  with 
average  length  1/|j.  Each  call  has  a  choice  of  a  1  hop  path  or  a  2  hop 
path.  A  call  arriving  at  a  node  is  equally  likely  to  be  destined  for 
either  other  node.  Thus,  there  is  total  symmetry  in  the  system.  It 
should  be  noted  that  a  call  taking  a  2  hop  path  occupies  2  links  but 
remains  in  the  system  for  time  l/p  on  average  (not  2/\j). 

The  simulation  is  straightforward.  Call  arrivals  are  generated 
randomly  and  arriving  calls  are  routed  according  to  the  algorithm  given 
above.  The  number  of  calls  taking  the  1-hop  and  2-hop  routes  were 
recorded  for  each  run.  A  parameter,  a,  determined  the  probability  of 
taking  the  1-hop  route  when  there  was  a  tie  between  the  2  routes  (a  = 
Prob  {using  the  1-hop  route  in  case  of  a  tie}). 
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For  u  =  .5  the  traction  of  calls  taking  the  l-hop  route  was,  not 
surprisingly,  very  close  to  For  a  =  0,  however,  the  fraction  varied, 
lor  A/p  =  1,  80%  of  the  calls  took  the  2-hop  path.  For  A/p  =  10,  66%  of 
the  calls  took  the  2-hop  path.  For  A/p  =  50,  65%  of  the  calls  took  the 
2-hop  path.  For  a  =  1,  the  results  (fraction  on  1-hop  versus  fraction 
on  2-hop  paths)  reversed  relative  to  the  results  for  a  =  0. 

We  thus  conclude  that  we  have  some  control  but  not  total  control 
over  the  routing  via  a  which  only  operates  during  a  tie.  The  control 
gets  greater  for  systems  with  a  smaller  number  of  calls  in  progress,  as 
evidenced  by  the  results  for  smaller  values  of  A/p  (which  is  directly 
related  to  the  number  of  calls  in  the  system).  Observations  of  the 
number  of  calls  in  the  system  at  various  points  in  a  simulation  run  led 
to  the  conclusion  that  the  system  is  stable,  i.e.,  that  the  link  loads 
reach  a  stable  level  and  remain  close  to  that  point  and  close  to  one 
another. 
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ABSTRACT 

This  paper  considers  the  problem  of  routing  traffic  with  many  desti¬ 
nations  through  a  congested  network.  The  problem  is  formulated  as  a 
very  general  dynamic  multicommodity  flow  problem.  The  optimal  control 
problem  which  results  is  a  very  difficult  one  to  solve  analytically  (even  for 
open  loop  solutions)  because  of  the  presence  of  Doth  state  variable  con¬ 
straints  and  delays  in  the  system  dynamics. 

The  difficulties  inherent  in  the  centralized  problem  can  be  avoided  by 
using  a  decentralized  approach  to  obtain  dynamic  routing  strategies.  Some 
of  the  results  obtained  using  this  approach  are  presented. 

INTRODUCTION 

Consider  a  network  made  up  of  N  nodes  connected  by  L  directed  links 
along  which  the  rate  of  traffic  flow  can  be  controlled.  M  of  the  nodes  are 
destination  nodes  toward  which  varying  amounts  of  traffic  must  be  routed 
in  accordance  to  the  traffic  demand. 

When  the  traffic  demand  exceeds  the  capacity  of.  the  network  to  meet 
this  demand  for  service,  congestion  builds  in  the  network.  In  the  model 
used  here  (first  suggested  by  Gazis^^^)  the  congested  network  is  viewed 
as  a  store  and  forward  network  in  which  all  delays  due  to  congestion  are 
treated  by  assuming  that  queues  can  build  inside  the  network  at  the 
network  nodes.  Traffic  is  assumed  to  move  between  nodes  at  a  constant 
velocity  (this  corresponds  a  fixed  travel  time  between  nodes)  and  is  stored 
at  the  nodes  in  queues  until  it  can  proceed  onward  toward  its  destination. 

The  degree  of  congestion  is  measured  by  the  aggregate  of  all  queues 
in  the  network.  The  problem  considered  here  is  that  which  occurs  after  a 
"rush  hour"  or  period  of  heavy  demand  when  the  demand  has  fallen  below 
the  capacity  but  congestion  is  still  present. 

Using  the  store  and  forward  model  the  queue  dynamics  can  be  expressed 
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where  denotes  a  continuous  state  variable  which  approximates  the 

length  of  the  traffic  queue  at  node  n  having  node  m  as  its  destination; 

denotes  the  rate  of  traffic  flow  to  destination  m  on  link  (nk);  r^”’ 
nk  n 

is  the  traffic  demand  entering  at  n  with  destination  m  and  is  the 
delay  on  link  (j,n)  encountered  by  traffic  to  m.  For  deterministic  input 
demand,  the  objective  is  to  clear  the  congestion  (so  that  qj^”'(T)=0  for  all 
n  and  m)  at  a  specified  time  T  which  is  large  enough  to  achieve  this 
objective  while  minimizing  the  aggregate  cost  of  congestion 
T  M  N  „  N  „  „ 

*nlc  “nk  (2) 

to  m-i  n— i  K— 1 

In  this  integral  the  terms  give  the  aggregate  delay  due  directly 
to  the  congestion  while  the  flow  terms  produce  a  cost  or  aggregate  delay 
associated  with  the  link  flows.  The  constraints  to  be  met 


M 


q^'”(t)  >  0;  u^j^™(t)  >  0  and  1  ^ 

m=l 


nk 


(3) 


for  all  m,  n,  k  and  t 


impose  the  requirements  that  queue  lengths  and  flows  must  be  non*negative 

and  that  each  link  has  a  finite  capacity  C_.  . 
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D'Ans  and  Gazis^  ^  have  shown,  that  an  optimal  open  loop  control  for 
this  centralized  control  problem  can  be  computed  by  discretizing  the  time 
appropriately.  When  this  is  done,  the  problem  can  be  expressed  as  a 
linear  programming  problem  and  an  optimal  open  loop  solution  can  be 
computed  for  any  initial  conditions. 

This  problem  with  all  equal  to  zero  was  considered  by  Moss  and 
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SegalP  They  present  a  set  of  necessary  conditions  via  the  maximum 
principle  but  they  offer  a  general  procedure  for  finding  the  flows  in 
feedback  form,  only  for  single  destination  networks  with  zero  input  demand. 

In  seeking  a  closed  loop  solution  to  this  centralized  optimal  control 
problem,  one  runs  into  considerable  difficulty  because  even  necessary 
conditions  are  not  available  for  problems  having  both  state  variable  con¬ 
stants  and  time-delay  systems. 


2. 


A  DECENTRALIZED  APPROACH 


Most  of  the  difficuties  inherent  in  the  centralized  formulation  can  be 
avoided  by  adopting  a  decentralized  approach  to  this  problem.  In  general 
the  decentralized  approach  to  controlling  large  scale  systems  is  to  use  a 
number  of  low  level  controllers  operating  on  local  information  to  control 
portions  of  a  large  system.  This  local  information  is  supplemented  by  some 
interchange  of  information  between  local  controllers  or  between  local 
controllers  and  a  supervisory  controller.  How  well  the  overall  system 
operates,  depends  on  what  information  is  exchanged.  The  local  controllers 
are  designed  to  perform  their  limited  tasks  optimally.  For  the  dynamic 
routing  problem  considered  here,  the  local  controllers  are  node  level 
controllers  which  must  assign  traffic  flow  on  the  links  leaving  the  node 
based  on  the  information  available  locally. 

The  Local  Optimization  Problem 


The  optimization  problem  faced  at  a  typical  node  is  based  on  the 
queue  dynamics 


q“‘(t)  *  R^'Ct)  -  I 


where  R*"  is  the  total  rate  of  traffic  arrivals  (from  both  upstream  nodes 
and  from  outside  the  network)  which  have  the  destination  m  and  there  are 
L  links  which  exit  from  the  node.  We  wish  to  find  the  link  flows  u^‘”(t) 
as  a  function  of  the  local  state  vector  q(t),  to  minimize 


as  a  function  of  the  local  state  vector  q(t),  to  minimize 
T  M  C 

J{u}  =  /  Z  (q"'(t)  ♦  Z  S,'"uAt)]dt  (5) 

to  m=l  4=1  *  * 

where  denotes  the  cost  or  delay  per  unit  of  flow  on  link  4  to  reach 
destination  m.  The  constraints  are  as  in  (3)  and  g  (T)  =  0.  It  is  ap¬ 
parent  that  since  all  incoming  demand  has  been  consolidated  into  one  term, 
the  local  controller  is  not  attempting  to  control  a  time-delay  system.  A 
feedback  solution  can  be  thus  be  sought  using  Hamilton- Jacobi  theory. 
The  minimum  cost  due  to  congestion  can  be  expressed  as 


T  M 

V(g(tO))  =  min  {J{u}-J*)  =  min  /  Z 
u  ~  u  to  m=l 


[q"'(t)  +  Z  S,"'(u,"'(t)-u,"'*(t))dt 
4=1  ^  ^ 


where  u^i^  are  the  optimal  non-congested  flows  (i.e.  the  flows  which 

minimize  J  in  (5)  when  g  (to)  =  0)  and  J*  is  the  corresponding  minimum 

L 


non-congested  cost  on  (to,  T).  Note  that  R  (t)  = 


(t).  When  the 


3. 


are  constants,  then  the  minimum  cost  function  V(  q)  satisfies  the 
Hamilton-Jacobi  equation 

min  i  lq"'(t)  +  Z  (S -  — n,)(uAt)  -  u  ^*)]  =  0  (7) 

where  V(0)  =  0  and  the  minimization  is  subject  to  constraints  as  in  (3).  If 
the  V(  g)  which  satisfies  (7)  and  boundary  conditions  imposed  by  state 
constraints  can  be  found  then  this  would  produce  necessary  and  sufficient 
conditions  for  the  optimal  flows. 

A  complete  solution  to  the  local  optimum  control  problem  is  not  yet 

available.  A  method  for  obtaining  an  approximate  solution  was  presented 
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by  Ozgiiner  and  Sarachik^  ‘ .  However  the  exact  solution  is  known  only 
for  two  special  cases. 


Two  Special  Cases 


The  Single  Destination  Case  (M=l' 


first  solved  this  local 


dynamic  routing  problem  using  a  geometric  approach.  Later  Sarachik  and 
Ozgtiiner^^^  obtained  the  same  solution  using  the  approach  above.  Since 
(7)  must  be  minimized  at  each  time  instant,  this  is  equivalent  to  choosing 
the  Ujj(t)  to  minimize 

«£  ■  I?  >  <8> 

subject  to  the  constraint  0  <  Ujj(t)  <  C^j.  This  says  that  we  should  choose 


U,(t)  = 


Thus  if  the  links  are  numbered  according  to  the  ordering  <  S2  £ 
...  1  Sj^  and  if  K  is  the  largest  index  £  satisfying  S^j  <  (av/3q)  we  see 

that  on  links  £=1,2 .  K  the  full  capacity  should  be  used  to  carry  traffic 

flow  whereas  on  links  £=K+1,...,  L  no  traffic  should  flow.  Substituting 
this  result  in  (8)  gives 


(C.  for  q(t)  >  Y. 
^£^^W  lo  for  q(t)  < 


(10) 


where  the  thresholds  Y^  are  given  by 
Y,  =  (C,-u*)(Sj-S,) 

for  £=1, . . .L. 


(11) 


4. 


Multi-destination  with  L=2.  Sarachik^^^  obtained  a  solution  tc  (7) 
with  arbitrary  M,  for  a  node  with  only  two  exit  links.  The  method  of 
solution  involved  finding  the  solutions  along  the  state  space  axes  (these 
are  one  dimensional  problems)  and  using  the  results  as  boundary  conditions 
for  the  admissible  state  space  g  >  0.  The  derivation  can  be  found  in  (7). 
The  results  show  that  the  admissible  state  space  is  divided  into  non¬ 
intersecting  regions  by  planes.  In  each  region  the  optimal  flows  must 
satisfy  a  necessary  and  sufficient  condition.  The  regions  can  be 
found  by  simple  threshold  tests  on  aggregate  queue  lengths.  Specifically 
with  (the  additional  cost  of  using  link  2)  the  destinations 

are  ordered  according  to  p^  >  p  2 >•  •  >  0  >  p  >. . .  >  pj^  (note  that 

traffic  to  destinations  1  and  K  prefer  to  use  link  1).  Thresholds  are 
calculated  using 

X^(k)  =  X^(k+1)  +  Yj^(k)  [Pjj-Pk+il  for  k  =  K .  1  (12) 

with  X^(K+1)  ^  Pj^^^  y^(K)  and 

X2(k)  =  X2(k-1)  +  Y2(k-1)  ^  ^  . 

with  X2(K)  -  -Pj^y2(K).  The  Y|(k)  denotes  the  capacity  available  on  link  i 
for  emptying  queues  when  the  demand  r"'  for  destinations  m=l  to  k  is 
routed  on  link  1  and  the  rest  on  link  2  (note  that  for  k  ^  K  this  means 
some  traffic  will  by  diverted  away  from  its  preferred  link).  The  composite 
queue  lengths  are  next  defined  as 

Q,(k)  ^  Z  q*"  ;  Q  (k)  ^  Z  q"™  (14) 

m*l  m=k+l 

and  the  test  variable  T^(k)  =  (Y2(k)Qj(k)-Yi(k)Q2(k))/Y2(k)  is  found  for  all 
thosek  <  K  which  also  make  Y2(k)>0).  Fork  <Kif  X^(k+1)  <  Tj(k)  <X^(k)  and 
Qj^(k),  Q2(k)  >  0  the  optimum  flows  satisfy 

a)  U,  (k)  =  C, ;  u™  =  0  for  m  >  k 

m  (15) 

b)  U2(k)  =  C2;  U2'"  =  0  for  m  <  k 

where  the  composite  flows  are  defined  as 

A  m  A  M 

U,(k)  ^  Z  u,'"  ;  U,(k)  »  Z  u,"’  (16) 

m=l  ^  ^  m=k+l  ^ 

whereas  if  Tj(k-l)  <  Xj^(k)  <  T^(k)  the  optimum  flows  must  satisfy  (15a) 
and 

U2(k)+U2*^  =  C2  ;  U2'"  =  0  for  m  <  k-1.  (17) 


5. 


The  only  difference  between  (15b)  and  (17)  is  that  (17)  permits  a  non-zero 
.  For  a  state  on  the  boundary  of  the  admissible  region  (i.e.  when 

Qj(k)=0  for  some  k  or  i)  the  conditions  (15)  and  (17)  must  be  modified 

slightly.  In  this  case  the  Uj(k)  must  equal  the  corresponding  aggregate 
demand. 

If  the  above  tests  fail  then  for  k  >  K  the  test  variable  T2(k)  = 
(Yj^(k)  Q2(k)  -  Y2(k)Qj^(k))/Yj^(k)  is  formed  for  all  those  k  for  which  Yj^(k)  >  0. 
For  Qj(k),  Q2(k)  >  0  if  X2(k)  <  T2(k)  <  X2(k+1)  the  optimum  flows  must 
satisfy  (15)  whereas  if  T2(k)  <  X2(k)  <  T2(k-1)  the  optimum  flows  satisfy 
(15a)  and  (17).  It  should  be  noted  that  (15)  or  (17)  do  not  specify  the 

flows  uniquely  so  many  optimal  implementations  are  possible. 

Coordination  of  Local  Control 


In  order  for  the  node  level  controllers  to  implement  the  local  control 
strategies,  information  must  be  supplied  to  it  about  the  parameters  S.'”. 
This  can  be  done  for  each  link  exiting  the  node  by  forming  ~  + 

♦j™  where  s^*”  is  the  delay  (or  cost)  of  traversing  link  £  to  the  first 
downstream  node  j  and  ♦j™  denotes  an  estimate  of  the  delay  (or  cost)  of 
reaching  destination  m  from  node  j.  These  cost-to-go  terms  must  be 
computed  at  each  node  and  transmitted  to  the  upstream  nearest  neighbor. 
The  way  these  terms  are  calculated  greatly  affects  the  performance  of  the 
overall  network.  Chu^^^  using  a  capacity  weighted  average  for  a  single 
destination  network  (M=l)  defined 


♦  ^Ct)  = 


qj"*(t)  ♦  I  S /"c, 

J _  1=1  *  * 


t 

1*1 


and  had  to  restrict  the  application  to  non-cyclic  networks.  Whereas  if  C^ 
is  replaced  by  the  excess  capacity  -  C^-u^  the  strategy  can  be  extended  to 
networks  containing  cyclic  paths  since  it  can  be  proven  that  no  cyclic 
flows  will  occur^®^.  Sarachik^®^  using  flow  weighted  averages  (i.e. 
replace  in  (18)  by  Ujj"’)  for  the  multi-destination  case  required  the 
non-cyclic  restriction.  This  restriction  was  removed  in  [4]  by  using  a 
certain  kind  of  excess  capacity  average.  The  question  of  what  is  the  best 
coordination  scheme  is  still  open. 


CONCLUSION 


This  paper  has  described  the  general  problem  of  using  dynamic  routing 
to  clear  congestion  from  a  congested  network.  A  decentralized  strategy 
was  described  which  utilizes  local  node  level  controllers.  The  optimal  local 
strategies  were  presented  for  2  special  cases  when  traffic  demand  is  con¬ 
stant.  Research  is  being  done  to  extend  the  local  results  for  traffic 
demand  modeled  by  a  Poisson  arrival  process  and  to  obtain  the  exact 
optimum  routing  for  M  destinations  with  L  exit  links.  More  research  is 
also  needed  to  determine  whether  an  optimal  coordination  strategy  can  be 
found . 


REFERENCES 

[1]  Gazis,  D.C.,  "Modelling  and  optimal  control  of  congested  transporta¬ 
tion  systems".  Networks,  Vol.  4,  pp.  113-124,  (1974). 

[2]  G.C.  D'ans  and  D.C.  Gazis,  "Optimal  Control  of  Oversaturated  Feed  - 
Forward  Transportation  Networks,"  IBM  Research  Report,  #RC4543; 
1973. 

[3]  F.H.  Moss  and  A.  Segall,  "An  Optimal  Control  Approach  to  Dynamic 
Routing  in  Data-Communication  Networks,"  Report  EE312.  Faculty  of 
EE,  Technion;  1977  (To  appear  in  IEEE  Trans.  Aut.  Control). 

[4]  Ozgtfner,  D.  and  Sarachik,  P.E.,  "Decentralized  Routing  in  Congested 
Multi-Destination  Traffic  Networks"  Proc.  of  1981  ICCC  Conference, 
pp.  1006-1009;  1981. 

[5]  Chu,  K.C.,  "Decentralized  real-time  control  of  congested  traffic 
networks",  IBM  Research  Report,  RC-6337,  Dec.  1976. 

[6]  Sarachik,  P.E.  and  Ozgdner,  D.,  "On  decentralized  routing  in  traffic 
networks,"  Univ.  of  Toronto,  Dept,  of  Electrical  Engineering,  Systems 
Control  Report  #8003,  April  1980.  To  appear  in  IEEE  Trans.  Aut. 
Cont. 

[7]  Sarachik,  P.E.,  "Optimal  local  dynamic  routing  for  clearing  multi- 
destination  neworks,"  EE  Dept.  Report.  Polytechnic  Inst,  of  N.Y.; 
Jan.  1981. 

[8]  Sarachik,  P.E.,  "Clearing  of  congested  multi-destination  networks," 
Proc.  of  Engr.  Found.  Conf.  on  Research  Dev.  in  Comp.  Control  of 
Urban  Traffic  Syst.,  Feb.  1979. 


SIO 


tEEf  TEANSACTtONS  ON  ALfTOMATIC  CONTEOL.  VOL.  AC-27,  NO.  2.  APRIL  191(2 


‘An  Effective  Local  Dynamic  Stralecy  to  Clear 
Congested  Multidestination  Networks 

PHILIP  E  SARACHIK 

Aiurmet  — Tlw  loiiltioii  ol  ■  local  roulini  proMcm  for  a  co«R«t«4  iio4c 
wllh  l«a  aMctoalivc  m*!**  ^  eruinaiiam  of  a  iraffk  nmvork  Is 

pmciMaa.  Tkt  solmloo  shows  Ihn  Ihc  admissible  stale  sfocc  K  disMcd 
huo  a  fiaite  aumber  of  rsipoat  aiibio  which  a  differcal  set  of  oetiaml  flow 
caadMoai  aasst  be  satlsliad.  A  specific  realualioa  of  these  oeiiaMi  coadi- 
PQHS  ■  pmmiOT  wwoi  cm  m  viiimv  sy  mmh  mmv  cowfomc*  n 
daecttESEfiaad  eaahwf  strategies  far  laiasr  aetwoHLs, 

L  iHTaODUCTtON 

The  problem  of  rouliaa  traffic  through  networks  from  nodes  of  or«|in 
to  destination  nodes  is  an  important  problem  oommon  to  the  field  of 
traffic  CMitrol  and  data<ommunication  networks.  Much  of  the  early  «wrk 
dealing  with  the  traffic  assignment  problem  in  both  fields  was  restricicd 
to  finding  an  optimum  sieady-suic  routing  (sutie  sirategiesl  under  non- 
congfstrd  conditions.  In  1974  Gatis  (l|  proposed  that  a  siotc-and-fotwaid 
network  could  be  used  to  model  congested  transportation  systems.  In  this 
model  the  delay  due  to  congcsiion  it  asiociat^  with  the  waiting  tinw  on 
queues.  For  a  single  destination  node  and  using  the  aggregate  delay  in  the 
nenrork  as  the  performance  criicrion.  D'Ans  and  Gaik  (2|  obtain  an 
cpen-loop  solution  by  discrctiiiag  in  time  and  Chu  and  Gatis  |3)  obtain  a 
dyaanuc  feedback  solution  for  a  simple  network  with  two  alternate  routes. 
At  About  the  same  time,  Segall  and  Moat  (4HbI  tued  a  similar  model  to 
study  the  dynamic  routing  praMsm  in  congest^  multidestinatioa  data- 
communication  networks,  liiey  smtmprl  that  the  delays  encountered  in 
trantiiing  network  links  are  asro.  so  the  resulling  queue  dynamics  are 
governed  by  ordinary  differential  equations.  Using  the  maaimum  princi¬ 
ple.  they  obtain  a  set  of  nerrsiary  conditions  for  the  optimal  solution  and 
propose  a  oonstructivc  algariihm  for  finding  the  optimal  centralized 
eoniroL 

When  transit  delays  are  present,  the  queue  dynamics  are  governed  by 
differential  diffcienoe  equations  and  the  general  centralized  optimal  dy¬ 
namic  touting  problem  becomes  formidable,  since  state  variable  con- 
strainis  are  abo  present.  Moat  of  the  difficnllics  inherent  in  the  central¬ 
ized  problem  ean  be  avoided  by  tacking  a  decentralized  oontiol  sohitkm. 

The  decentralized  approach  to  clearing  congestion  from  traffic  net¬ 
works  has  been  discus^  in  three  recent  papers  (7H*(.  In  these  papers 
the  problem  of  finding  reaf-time  dynamic  feiMback  routing  for  iraffic  is 
fonnulaied  as  an  optimal  control  problem  and  a  dcocntraKicd  siructure 
for  the  solution  is  utilised.  Gw  (7]  considered  a  single  destination  acyclic 
network  and  presented  a  solution  baaed  on  the  inieroannection  of  local 
oontiollers  which  implement  route  Klections  at  each  node  by  choosing 
among  K  alternate  routes  to  the  single  destination.  Sarachik  and  Oigikner 
(9|  considered  tingle  destination  cyclic  networks.  They  show  that  the 
optimal  local  routing  algorithm,  when  used  in  their  decentralized  strategy, 
leads  to  loop-free  flowrs  in  the  network  even  though  cyclic  paths  arc 
present.  Sarachik  (9)  considered  multidestination  a^fdic  networks  and 
presented  a  decentralized  strategy  for  that  cam.  Local  routing  strategics 
for  two  specific  simple  subnetworks  were  utilized  in  the  decentralized 
strategy  to  illustrate  the  control  of  a  larger  network. 

In  order  to  make  the  decentralized  routing  strategy  of  (I)  for  multi¬ 
destination  netwwrks  more  generally  umful,  this  correspondence  considers 
the  problem  of  designing  a  local  feedback  controller  for  a  typical  node  of 
a  multideslinaiian  network  when  the  node  hat  two  rail  links  (see  Fig.  I>. 
An  optimization  problem  it  formulated  and  solved  for  this  local  problem 

Manwotf*  wnmee  FeWnmv  21.  tsai;  icviMe  Aiisiwi  2.  ISSI  TMn  wnrk  «■»  mw. 
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na  I.  AlspWaaecaMllawaalilheiA 


when  the  demand  for  service  is  constant  It  it  shown  that  the  state  space  is 
divided  into  regions  separated  by  linear  manifoldt.  The  dosed  form 
conditions  for  the  optimal  control  are  given  for  each  region.  These 
conditiottt  do  not  specify  a  unique  optimal  control  to  a  specific  optimal 
realization  is  prtscnicd.  This  realizalinn  b  in  the  form  of  a  fe^ack 
control  which  cmi  be  easily  implemcnicd  in  real  time. 

In  using  any  dcccntraliaed  control  strategy  to  route  traffic  through  a 
complei  network,  decisions  asust  be  made  locally  at  each  node  on  how  to 
best  allocate  traffic  in  braacbm  leaving  the  n^  These  decisions  will 
affect  the  traffic  Icvsb  at  downstream  nodes  and  thus  must  he  based  on 
information,  provided  to  the  local  node  controllen.  about  conditiom  at 
other  nodes  of  the  network.  Quesiiom  concerning  how  the  communica¬ 
tion  betsreen  nodes  affects  the  overall  behavior  of  large  networks  which 
use  decentraliasd  rmitiag  are  eztreasdy  important  but  are  beyond  the 
scope  of  tins  paper.  Refctenoss  (7H*I  addicm  some  of  these  questiotH 
and  indicaie  how  local  node  eenttollm  can  be  utilised  in  a  decentralized 
dytianuc  routing  strategy. 

II.  Ibn  UscAL  Rourmo  PaonuM 

The  demand  for  service  u  thr  net  traffic  flow  arriving  at  a  typical  node 
along  inbound  Nnks  and  from  outside  the  network.  Thu  demand  b  met  by 
routing  the  incoming  traffic  to  the  eiit  links.  When  iraffic  demand 
cacseds  the  capadiy  of  the  ezil  Knks.  queues  build  in  front  of  the  node. 
When  the  capacity  ezoeeds  the  demand.  Ihc  queue  lengihs  will  be  de¬ 
creased.  In  this  paper  atieniion  b  restricted  to  the  situation  which  csistt 
lovmrri  the  end  9(  “rush  hours.'*  That  b.  congestion  cibts  in  the  form  of 
long  queues  but  the  demand  b  km  than  the  capacity  of  the  exit  links  so 
the  queues  can  be  emptied  stM  the  congcsiion  cleared  oul 

At  a  congcslcd  node  conlaiaing  queues  of  iraffic  for  M  destinations  and 
having  erdy  two  exit  links,  Ihc  queue  dynamics  be  represented  by  (I). 
01 

4"(»)»r--ur(t)-ur(r)  for  m- 1.2.  .M  (I) 
or  in  vector  form 

where  q"  denotes  a  continuous  state  variabk  which  approximates  the 
kngih  of  the  iraffic  queue  (measured  in  vehicles,  bits,  messages,  etc.) 
having  node  m  as  its  dMinalion:  u"  b  the  flow  rate  on  link  i  destined  for 
HI  and  r”  is  a  constant  flow  of  traffic  entering  the  node  destined  for  m(r 
is  the  demand  vector). 

The  problem  faced  by  the  node  controller  b  to  choose  the  control 
vevlon  u,  and  Uj  at  each  lime  instant  in  order  to  empty  the  queues  while 
optimizing  some  performance  measure.  A  useful  measure  of  this  perfor¬ 
mance  (7]-(  I0|  b  the  aggregate  cost  (or  delay)  given  by 

2  (9"(o-*-srvr(t)+s2-»-(t)M  (2) 

•  I 
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ill 


where  S,”  denotes  (He  eosl  (or  delay)  per  uni(  of  Do*  on  link  i  in  n.*aeh 
dcsiiiution  m.  TV  lime  T  is  any  lime  greaier  than  the  time  r*  reipiired  N* 
cmpiy  all  queues  il  iV  node  wiih  nuniinal  eitsi.  Nme  ihai  ihe  eoniohu- 
(iflii  lo  J  due  m  iV  q**  lerms  is  ihe  anrsiaie  delay  asMwiaied  wiih 
ckarini  IV  queues  1 1 1.  |S|.  while  ihc  iNher  terms  represeni  iV  aitre^ie 
COM  (nr  deteyl  assoeiaKil  wiih  reaehini  ihe  desiinaiuuis  (mm  iV  emianee 
of  IV  Mil  links  |I0|. 

Il  should  be  noted  ihai  wVn  ihc  local  cnniraUer  is  used  in  a  deuratral- 
iced  Kiiin|.  Ibe  coal  paramcicTs  S,"  wtH  be  campoMd  td  iwo  ponttma 
ST  *  C  *  *7  C  ■■  lx  datoyl  of  uaiwnini  link  i  in  dw 

first  downsiicain  nude  >  and  d*  is  a  aam  (nr  dalayl  aisnciaisd  wsdi 
rcadiii^  destination  m  from  nudc>.  TV  node  conimNeT  will  feeeivc  the 
inlormaiion  d*  from  iis  downstream  nriilibnei  and  update  die  values  S^. 
In  operation  IV  node  coniioller  muai  he  dble  to  ad^  to  iVae  updates. 
Tliua.  a  real-lime  icedbaefc  loluiion  of  iV  local  optiaaiuiion  problem  is 
csscniial. 

In  dut  problem  iV  queuet  can  V  emptied,  ainm  we  coniider  only  dw 
cam  far  whkli  dw  total  traffic  demand  d  wX;X.,/*cniefint  dw  nodeit 
lew  than  dw  combinad  capacity  of  dw  tarn  aait  linka  (Ci-t’C)).  TV 
optimal  noneandcMad  aaat  7*  can  V  definad  m  dw  miiumum  of  (2)  when 
dw  initial  ataw  jfia)  ia  aanL  Timiy  for  dw  same  r  m  in  (21. 

*-mml 

where  u,**  art  dw  optimal  nanaanpamad  llnwi  and  if  TV 

nwniaaum  coat  due  to  dw  coadoatian  at  dw  nada  ia  abtainad  by  ntbiraet- 
inp/*  from  dw  miniatum  of  (2).  Tins  liwm 

K(,(f,))-min(/(,)-J*l 

CfCl 

•  mi*  {'2  [f-(f)+5r(nr(»)-t.r)+^-(ur(f)-ur)]dr 

at'ft%aa) 

w 

where  dw  adnimiulisn  is  carrisd  out  au^aai  in  dw  aenatrainii 
sr 

j(r)»0:  f,(f)»0:  2  "."(OxC  loriw|,2;  r,<rxr 

m^t 

(5) 

and 

f(r)-« 

TVac  conatrainta  impose  dw  raqiiirmwnu  that  queue  Icmiha  and  (Iowa 
must  V  nonneiaiive.  ihai  each  link  has  a  finite  capacity,  and  dial  dw 
queues  must  be  empty  ai  iV  terminal  time.  Note  dial  lor  a  given  ^r,) 
and  any  r>r*  (fcoM  V  x)  iV  same  minimum  coat  K  ia  obtainH.  ao 
y  does  not  deptmd  on  iV  linw-iwfo  <r-if|.  Also,  system  (I)  is  time 
invariant  and  dw  coefTwienu  in  (2)  arc  oonatant.  TVrefore,  I'  ia  indepen¬ 
dent  of  dw  aiartind  time !«.  Applying  Hamilton- Jaoohi  theory,  srr  know 
that  dw  miiumum  cost  l'(|)  for  system  (Daaiiafimdw  partial  diflrrtniial 
cquaiion 

•"‘"I  2  [d'(o+Tr(-r(')-«r*)^v(ur(')-*r)l 

ri-n  (.Ai 

where  iV  minimiiaiion  is  suhieri  lo  consirainis  (5|.  Using  r*  ft  t] 
ihis  cqiuiirm  can  be  rcwniicn  as 


-«r*)]|=''  (^) 

liV  dependence  of  q**  and  «,**  on  r  is  noi  indicaicd  eapliciily).  Note  ihai 
liner  (7)  n  linear  in  dw  p,'s.  dw  minimum  lies  on  tV  constraini 

hmmdarwa.  and  siner  raw  of  thaw  bmmdarim  involve  sums,  an  optimal 
•afMian  nasd  nm  V  unique. 

•y  findiap  dw  ITfl  which  sadarwa  (7)  and  dw  boundary  condiiioiis 
impmod  by  iV  slaw  ewnitrainu.  we  can  obtain  capMcii  necessary  and 
mMann  canditions  for  dw  aptimal  flows. 

HI.  OrruuL  Flow  CoNDmoNS 

In  dns  taction  dw  eoaditiam  far  opiimal'flaw  arc  prcaaniod.  A  detailed 
deacripiien  of  how  ihaac  aandilioas  art  deriuad  is  giran  in  II If.  To 

nnmiify  dw  pnaaniaiian.  dw  dasdnaiians  are  nwuhersd  so  that  (or 

iL.*sr-sr 

With  this  mimhrwa^  iralfic  m  dmiiaaiiana  I  duough  K  prefers  link  I 
while  Mnk  2  is  prafiirad  by  traffic  m  detdnaiinna  K  *  I  ihreugh  M.  Also, 
if  il  haasnwa  nsonaaiy  m  dirart  traffic  fram  fink  I  to  link  2.  traffic  to  k 
diauld  nm  V  dmrtsd  until  all  traffic  m  m>k  hm  haan  diverted. 
Conusraaly.  when  traffic  must  V  dranaad  foam  fink  2  to  fink  I.  traffic  to 
k  diould  ant  V  divortad  unifi  afi  traffic  m  m  <  k  hm  boan  diviirtsd. 

It  is  canvoaisni  m  dsfina  dw  cacam  capaaitim  m 

Ti(fr)Ac,-  2  7i(ir)*c,-  2  -f’  (») 

«U| 


Ty(*)*maa[o.Ti<4r)-^2  u,-*]:  yt(*)-y-ya(k) 

farkwo.  ••.«r-i  (*) 

y,(k)»maa[o.T,(4r)-  2  -ri:  ya{*)-y-y.(*) 

I  — OAI  J 

brk-irt- (10) 

adwrc  T^TiflTKyifiO.  Note  duN  (•)  definm  dw  capacity  available  on 
each  fink  fmcfaariugqttitwa  after  dw  sicady-siaw  demand  is  accotnmod- 
sisd.  whan  an  ateady-iiaw  flitw  is  divirtsd.  When  C  •  and  C:  arc  each 
larac  enough  m  accimmaodaw  dw  pre lined  drmandi.  then  «r' s  r". 
of  ■  0  far  m  <  IT  and  or' >  0.  or* «  r*  far  m  >  K.  and  both  Ti(  I  and 
K I  are  greater  than  icro.  if  one  nit  link  hm  insufficient  capacity  C,  to 
accommodate  its  pngforrod  traffic,  ihea  m  much  preferred  traffic  as 
poswMe  is  touted  an  diis  link  (eriik  primitict  goocraed  by  tV  fl.)  giving 
y,(  A' ) «  0  and  dw  real  is  diverted  to  iV  other  link.  It  is  thus  quite  simple 
m  determine  dw  vahwa  o"'  far  all  m  and  i  far  this  problem  and  to  form 
iV  icu  T,  *|y,l0l.-  -.T,(kfH  far  1*1.2  using  (I).  (4).  and  (10)  TV 
y,(k|  siJ  yj(k)  vahwa  from  (4)  arc  IV  caoeas  capacities  available  (or 
emptying  queues  when  demand  for  dcaiinaiiona  (k  1)  to  A'  is  diverted 
from  link  I  to  2.  wherem  iV  vahwa  foam  (l(l|  ate  iV  eaoeat  capacities 
when  dw  demand  far  destinations  A  t- 1  m  k  is  diverted  (ram  unk  2  to  I. 
Diversion  of  traffic  from  one  fink  to  another  can  eventually  use  up  tV 
capacity  of  tV  fink  rtorivtng  dw  divertod  traffic.  To  keep  track  of  this,  we 
define 

4  ft  dw  imsHcit  value  of  k  such  that  r:(  k )  >  0 
k  ft  iV  larftai  value  of  k  Mich  that  ri(  k )  >  0 
Aftmin|A.k|:  4(^mu(A.k|  (M) 
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A  SCI  o(  threshold  consunis  can  now  be  defined  recurMvely  bv 

2f,(ir)  =  2f,(*T|)*y,(*)(^,-^*.,l  .1  (12) 

with  2f,(  AT  -r- 1 1  ,  T,(  If  Hso  .ir,(  4  )  >  0  i.i  nondecreasini  iis  k  decreases 

from  K  lo  I )  and 

lf,(*)  =  2f,(*-l)i-Y,(*-l)(4,-,-^.I  for*  =  lf  >1.  .W 

(‘J> 

with  X^K\^  -  figy^K)i*o  X^k)*<i  ii  iMndecrcasin|  aa  k  increaan 
from  IT  I  to  M).  Compoaiie  qtictMs  are  defined  at 

t  w 

2  2  9*'  =  <?-ei(*)  ('«) 

■t- 1  «  >1 «- 1 

where  (?  is  the  turn  of  all  queue  kntiht.  The  oompoMic  flows  on  each  link 
are 

a  tr 

t/|(*)*  2  2  ('*) 

•••l 

Note  that  k^O  means  0|(O) - 0  and  (/|(0)  =  0  while  i  >  M  means 
Q^M)  =  0  and  V^M)  =  0:  for  notaiional  convenience  the  explicit 
dependence  of  these  quantities  on  /  hat  been  omitted.  The  upper  flow 
bounds  are 


When  ihe  siaie  vector  q  lies  in  a  region  optimal  flows  must  satisfy 
the  condition 


f/,(Ar)  =  t/,(*)=  J 


Ut(0)‘U,(k) 


(k)  i!Q,(k)=0 

otherwise 


if(?,(*)=0 

otherwise 


and  the  lestrictiao 


e,*  +  «|'«r“  whenq*'  =  0. 


Note  that  (/|( Ml  =  (/|(4)  means  that  u,**  =  0  for  m  >i  and  f/^fO)  =  (/jf 4 ) 
meant  tr}*  =  0  for  m  « Ir. 

If  the  suie  vector  «  lies  in  a  fC|ion  then  optimal  flows  must 

latisfy  (21a)  and 


ifC,(*-i)  =  o 

otherwise 


C,(k)^minlc,.  2  r*|:  C,(*)4min|Cj 


sr 

.  2 
w-S-sl 


The  solution  of  the  Hamilton-iacohi  aquation  (7)  shows  (li|  that  the 
admissible  state  space  q  >  d  is  divided  into  disjoint  regions  which  fill  the 
space.  In  each  region  l^(  j)  is  |ivcn  by  a  different  quadratic  function,  but 
it  is  continuous  across  the  legion  boundaries-  These  rctiont  can  be 
expressed  conveniently  in  tentit  of  variables 

r,(0*tfi(*)-^<?j(*)  derinadlor*<* 

r, (*)*«, (*)-^0i(*)  dcf«edlor*<*  (17) 

The  regions  arc 


X,(k  +  l)<T,(k)^X,(k)  fort<k<min[ir-l.k] 

=  .  q: 

Xi(k)<Ttik)*X,(k  +  \)  for  max(fT+l.  *)«*<* 

(••) 

In  ( 19)  (recall  that  K  it  the  number  of  destinations  which  prefer  link  I).  if 
IT  =  0  the  upper  bound  it  tern  but  if  JT  >  Af  the  lower  bound  it  aero. 


as  well  as  restriction  (2lc|.  Note  that  actually  the  only  difference  between 
theae  oonditimit  it  that  (2ld|all*ws  a,  to  be  nonxero  (this  it  rrqwm/onlv 
•hanflif*  - 1|  w  0)  so  traffic  to  destination  k  could  be  touted  via  both 
Knks  I  and  1.  Since  these  conditions  do  not  specify  the  optimal  flows 
uniquely  any  implsmentatian  which  it  convenient  win  be  optimal.  These 
condilioas  arc  necessary  and  tuffieieni  for  optimality  because  they  were 
obtained  from  Kff  |  which  taiislies  the  Hamihon-Jacobi  equation  (7)  and 
boundary  conditions  imposed  by  the  suic  constraint  q>0.  (Reference 
III)  gives  expressions  for  the  minimum  cost  t'(f)  in  Mch  region  and  a 
proof  that  these  regions  nil  the  admissible  state  ifct  q  >  •) 

The  region  in  which  q  lies  could  be  determined  by  using  a  micro- 
pmeessor  or  a  special  chip  which  implemenu  the  conditions  ( lll)-(20)  in 
paraRcl  by  comparing  the  vector  of  test  variables  against  the  vector  of 
dMuttbild  values. 

IV.  A  Snanc  OmsiAL  Routing 

In  the  preceding  section  the  general  requirements  (21)  permit  many 
optimal  routing  strategics.  In  thu  section  a  spcciHc  optimal  routing  is 
pmentod  which  satisfies  them  general  requiremenu. 

For  Mates  lying  in  the  regions  R|t  coruidcr  the  (lows 


a)  ur*“r’  +  er"  +  Yi(k)^^.  ■"  =  **  forl<in<* 


b)  »r-o.  ur“«r'+ur'+Ti(k)- 


for  k  + 1  «  m  <  M 


T,{k)>X,{k)  1ork^i*K 

r, (*-!)<  Jf,(*)<r, (A)  fof*<*«jf 

r,(k-l)<0  for*w*  +  l«Af 

T,(*)«0  for**i»Jlf  +  l 

Ti(4)«  -I)  totK<k*k 

T,(k-\)>  Xi(k)  for*=4+l» 


for  *  =  4  +  I  >  R  +  I 


The  region  in  which  q(/)  lies  and  the  index  value  4  ts  obtained  hy 
determining  which  of  the  inequalities  (Ik).  (19).  (20)  the  state  vector  q(r) 
satisfies. 


where  for  0|(  4 )  =  0  or  4 )  w  o  the  indeterminate  term  0/0  is  defined 
to  be  icro  (thus  (21c)  is  utisTicd  when  q**  =  0).  We  now  verify  that  (22) 
satisfies  the  requirements  (2la)-(2lb).  For  Q,(k)>0  and  ^;(4)>0. 
summing  (22)  gives 

s  s 

■)  t/,(4)=  2  2  -r‘+Tt(*) 

M  «  I  M  •  I 

b)  t/,(4)*  2  2  -r'^7,(4).  (23) 

•■1*1  •■S*! 

Now  y,(4 1  >0  and  y^4  )  >0  (this  is  necessary  for  X^^  to  exisii  so  for 
4  «  R  we  have  u"'  =  0  (or  m  «  4  and  u""  =  0  (oi  m  »  R  +  I  Thus. 


•)  u,(k)=  2  2  -r‘=c. 
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K  M 

b)  (/,(*)=  2  1  »r'  +  7i(*)-Ti(*) 

M 

♦  2  “"'=^1  (i*) 

«sA  4  I 

For  k  >  K  this  is  demonstraied  in  i  simiUr  muncr.  WBim  0 

ihc  T|(  k )  icrm  docs  not  appear  so  (23a)  gim 

k  k 

2  (-r+ir*)*  2 

M  •  I  «  ■  I 


havt  juki  dcmoiikiraicd  ilui  in  ihik  cane  ihe  comptisiic  no»k  (261  art 
npliiiHini  B>  dcfiniiKW  (I4|  for  the  compitsiii'  kuick  note  iHai  if 
Q;t  k  -  1 1  =  0.  Ihen  y ,( m )  =  0  and  p,(  m  1  =  L  -  1 1  fiw  all  m  >  L  -  I 
For  k  >k.  (2(1)  requires  iha(  tj<*  -llp|(*  l)<  -  T|(A  -  l)JT;(i  ) 

which  Ik  ckarlk  impossible  kince  C^i(  4  -  1 1.  r,(  4  -  I ).  and  .V,(  4  i  cannoi 
be  ncfalivc.  For  k<k<K.  (20)  requires  ihe  impossible  condition  ^,(4 
-  I)«  A',(*l''0i<^  ”  conclude  ihcrelorc  ihji  a  kiaie  y  wiih 

P}(4  -l)  =  0  can  saliifs  the  conditions  (20)  tor  ft^  ^  ,  only' when 
t  =  k  <  K  A  similar  ar(ument  shows  ihai  when  ^,(4  )  -  0  the  stale  can 
be  in  |  only  when  *  =  4  +  I  >  If . 

V.  CONCLUUONS 


since  ti(A|>0.  SimiUrly  when  ihe  ritkl  doc*  Mi  appear  in 

(23b> 


at 

■i*t«i 

For  SUMS  lyint  in  (he  it|ioM  ■>**  liiuaiion  is  eem^caicd  by 

deteneracies  whieh  can  occur. 

A  oompoaiiecapresaianwIuctiiimapiinMl  floM  tor  jin  Bt.a.)  is 

.)  ,r-.r-*.--*r,(.)54i5:  .r-o 

b)  «f  =  «f+r,(*)^^+A((i) 

d)  -r»o; 

tor46l«««M  (25) 

where  the  indeiemiinaK  icmi  0/0  is  defined  lo  be  icra  when  either  (?•(*) 
or  (?]f  i  - 1)  isicroao  a|ain(2lc|  is  satisfied  when  4"*0.  ThesynilMis 
used  in  (25)  arc  defined  ee 

r,(*)*Ti(*-i);  r,(*)iTi(*-i) 

A(*)*-  Tj(*);  A(*)*-ur  (or4#i 

ri(*)*T.(*);  r,(*)*T,(*) 

A(4-e|)*Ti{*);  latk^k*t 

We  now  verity  tbei  (25)  gives  U,(k)^C,  and  Uf(k  -t)=-Cf  which 
satisfies  die  conditions  (21s).  (2ld)  tor  an  ^timum.  When  4  =  4  >  X  or 
4  =  4-fl«Xor4isinthc  range  4  <  4  <  4  ■)■  I.  the  r,(4 )  arc  positive  so 
the  result  can  be  shown  ciactly  as  before.  When  4  -  4  <  X  and  0i(4)  >0 
we  find  by  summing  (25)  that 

»)  t/i(4)=  f  «r*+ 2  •<r+7,(4-i)-T,(*) 

W  M 

b)  Mi(4-i)*  2  “T’*  2  •'r'  +  Tj(4-')  +  r,{4)  (26) 

«■  t 

Since  yi(4)>0  we  have  ul"'  *  0  for  4  »  X-H  and  since  tj(4  -  l)  =  0  we 
have  Ti(  4  ~  ~  Y  >  0  and  ir5*’  =  0(orm«4-l.  Thus  using  (I)  and  (9) 

a 

t/i(4)=  2  •'."■•►y-t,(4)-C, 

fit  ■  I 

tC  k0 

i/,(4-i)=  2  2  T,( 4) *c,  (27) 

«•« 4  ♦  I  M« 4 

TTiik  result  can  be  obtained  in  a  similar  manner  for  4  =  4  ■*- 1  >  X 
To  show  ihai  (25)  is  opiimom  when  Q^tk  -  I)  =  0  we  wilt  show  that 
when  this  happens  the  state  q  can  only  be  in  Xi  « . ,  with  4  c  X.  and  we 


tor4>X 


ror4«X 


The  problem  of  finding  an  optimal  routing  strategy  for  local  node 
controllers  in  a  multideslinalion  network  has  been  solved  for  the  case  of 
nodes  with  two  csit  links  and  constant  demand.  It  hat  been  shown  that 
(he  admissible  state  space  it  divided  into  control  regions  separated  by 
linear  manifolds  and  eaplicit  expressions  for  the  boundaries  of  these 
regions  arc  presented.  To  each  region  there  exists  a  set  of  controls  which 
are  optimal  for  the  slates  of  that  region  A  specific  feedback  realixalion  of 
•he  optimal  control  has  alto  been  preKnted  The  implementations  (22) 
and  (25)  tor  the  optimal  flows  arc  not  the  only  ones  possible.  They  are. 
however,  conveuient  tor  traffic  control  purposes  bcrausc  they  provide 
tome  exit  capacity  to  traffic  for  each  of  ilte  destinations  and  t)ie  longer 
ihe  queue,  the  larger  will  be  its  share  of  the  capacity.  They  can  alsr^  be 
readily  implemented  using  a  microprocessor  since  they  consist  merely  of 
the  sum  of  tieady-tute  terms  and  a  linear  feedback  on  the  ratio  of  a  state 
variable  «**  to  ilie  composite  state  Q,  of  which  f**  it  a  pan.  The 
steady-state  values  mutt  be  available  at  the  local  controller  either  by  local 
computation  or  they  must  be  supplied  by  a  central  supervisory  controller. 
Note  also  that  if  either  the  demand  or  the  exit  capacity  changes  to  new 
contunt  values  then  tite  y,  vectors  change,  so  that  the  boundaries  of  the 
control  regions  and  the  feedback  gains  change.  This  merely  requires  a 
rcdcterminaiion  of  Ihe  region  in  which  q  lies  The  feedback  routing  given 
is  therefore  adaptive  to  changes  not  orQy  in  demand,  but  in  capacity  as 
well. 

It  should  be  noted  that  the  solution  for  r  constant  may  not  oc  as 
restrictive  as  it  appears.  For  example  r  could  represent  the  average 
demand  over  some  interval  so  it  would  be  constant  on  (hat  interval. 
Furthermore,  the  routing  (22)  gives  constant  flows  v^tr)  at  long  at  the 
state  remains  in  one  region.  Tlie  routing  (25)  gives  almosi  constant  flows 
in  one  region.  Thus  even  in  decentralixed  operation,  when  all  nodes  use 
this  local  strategy  the  incoming  demand  can  be  considered  at  piecewise 
constant  over  intervals.  The  feedback  routing  could  track  the  changes  by 
updating  the  y,  vectors. 

For  data  communication  networks,  tlie  problem  solved  here  could  be 
applicable  when  there  are  delays  or  costs  associated  with  .vending  mes¬ 
sages  to  the  dcsiinaiwns.  Of  course,  it  it  usualK  not  desirable  to  mix  the 
messages  for  the  various  dcstinatioru  on  a  single  link,  to  a  different 
specific  implementation  would  be  preferable  in  such  networks. 
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Ahotrict 


A  dyaaaic  altonuco  routt  itratofy  la  liovolopod 
fee  a  typical  nod*  of  a  traffic  cetwork.  The  strate¬ 
gy  coaparaa  the  queue  length  at  the  node  agaioat  aa 
optiaul  threshold  value.  «heo  t.he  threshcld  is 
exceeded,  soac  of  the  traffic  is  diverted  along  an 
altacnate  arare  coatiy  route  to  ita  deatioation. 

The  local  node  level  atrategy  is  eaaily  la- 
pleaeated  ia  real  tiaM  and  ia  also  adaptive  to 
chaagea  ia  route  capacities,  input  rates  or  the  rel¬ 
ative  delays  along  the  alternate  route.  It  ia  also 
suitable  for  uae  in  a  decentralized  routing  strategy 
for  large  networks. 

Introduction 

The  store  end  forward  network  aodel  has  been 
found  to  be  very  useful  for  probleaia  concerned  with 
■owing  traffic  through  congested  networks  froa  origin 
to  deitiaation  nodes.  With  this  aodel,  whenever  the 
instantaneoua  traffic  demand  exceeds  the  capacity  of 
the  network  to  use  the  deisand  for  service,  congestion 
builds  in  thn  network  ia  the  fora  of  queues  which  are 
stored  at  the  network  nodes.  For  a  fixed  network 
structure,  the  problea  faced  at  each  node  is  how  to 
choose  the  routes  along  which  to  send  traffic  on 
toward  its  destination  while  ainiaixing  soae  aeasure 
of  cost  or  delay  and  satisfying  the  capacity  con¬ 
straints  of  the  network  links.  For  stochastic  traf¬ 
fic  deaand  aost  of  the  recent  work  has  eaphasixed 
dimaaic  or  adaptive  routing  scheaes  (l-o]  which  use 
Inforaation  about  the  actual  state  of  the  node  or 
network  to  deteraine  the  best  routing.  In  alaoat  all 
'this  work  a  node  with  I  routes  to  the  destination  is 
awdellcd  as  L  parallel  N/M/1  queues.  The  routing 
decision  is  aade  at  the  tine  a  custoaer  arrives  by 
placing  hia  in  one  of  the  L  queues.  Foschini  and 
Salx(2y  show  that  for  high  traffic  deaand  a  Join- 
the-shorteat-qiieue  rule  gives  average  queueing  delays 
approaching  that  of  the  M/M/L  queue  (i.e.  average 
delay  is  reduced  by  s  factor  of  L  coapared  to  equal 
splitting).  For  two  sltemate  routes,  Boerstyn  sod 
Livne  (6J  also  get  perfomance  approachiog  the  H/H/2 
case  by  adding  a  third  queue  which  c.in  use  either  of 
the  two  servers  when  the  dedicsccd  queues  bccoae 
eapty. 

It  would  seen  that  a  aore  direct  approach  to 
achieving  such  reduced  queueing  delays  at  the  a  node 
is  to  operate  the  node  using  a  single  queue  with  L 
possible  servers.  The  routing  decision  esn  then  be 
put  off  until  the  aonent  a  custoaer  gets  to  the  need 
of  the  queue  sad  aust  be  put  into  service.  In  Fig.  t 
the  box  0  deootes  s  decision  rule  which  allocates 
traffic  to  the  route  servers.  The  decision  can  t.hen 
be  aede  an  the  basis  of  the  latest  state  lafomation 
atid  also  on  the  delays  to  be  encountered  along  the 
different  routea  (l). 

This  problem  has  been  solved  for  the  esse  of 
deterainiatic  input  denand  (7,8|,  where  it  was 
found  that  the  optimal  routing  strategy  is  s  thres- 
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hold  stratsgy.  The  purpose  of  this  paper  is  to 
investigate  the  threshold  strategy  for  a  discrete 
queue  dynemic  model  with  ranoom  Fottann  arrivals. 


Fig.l  A  single  queue  alternate  route  model. 


A  Threshold  Strategy  for  Local  Alternate  dnutieg 

Consider  any  source  node  and  a  single  desti¬ 
nation  node  with  L  routes  between  them  indexed  by  i 
•  1,  ...,  L.  The  problem  is  to  assign  traffic  to 
these  routes  ia  order  to  minimize  the  average  delay 
in  reaching  the  destlaetioe. 

We  astume  that  traffic  arrives  at  an  average 
rate  X  with  Poisson  iatcrarrivsl  times  and  that  the 
langtb  of  each  unit  of  l.-affic  is  expoaentially 
distributed  with  mcea  l/p.  (For  neesagct  cb«  units 
are  in  bits  and  for  vehicles  the  units  are  in 
feet) .  All  traffic  arriving  at  the  node  it  as¬ 
signed  to  s  single  queue  and  Che  dcclsicn  on  which 
route  to  use  is  made  only  when  it  reaches  tie  front 
of  the  queue.  We  have  then  Che  situation  of  s 
single  queue  with  many  potential  servers  as  showc 
ie  Fig.  1.  Letting  Q  denote  the  total  nuaunr  of 
custoaers  (aesseges  or  vehicles)  waiting  in  the 
queue  and  in  service,  Little's  foraula  TQ  = 
gives  the  average  time  spent  on  Che  queue  and  in 
service.  Furtberoore  all  traffic  using  the  . 
route  will  encounter  sa  additions!  average  delay  at 
S.  before  reaching  the  destination.  Thii  sddi' 
tionsl  delay  includes  both  transit  delays  if  aoy 
sad  queuing  sad  service  deliys  at  nodes  alcn;  the 
route  to  the  destination.  The  average  total  delay 
CO  reach  the  destiostion  it  thui  given  by 

T  *  r-  t{Ql  *  X  3.  (1) 

A  A  i^j  1  1 


Where  X.  is  the  average  rate  at  which  traffic  is 
tent  along  route  i.  For  a  deceninislit  version  of 
this  problea  with  coastaat  a.-riv.il  cite,  the  opci- 
aiel  routing  strategy  was  found  to  be  a  chres.'iol.i 
strategy,  (f]  That  is  to  say  when  the  routes  are 
ounbered  so  Chet 


<£,<...<  (2) 
then  the  optucal  strategy  is  to  use  louce  i  at  foil 


capacity  if  aod  only  if  Q  aacaada  a  tbraihold  K. 
(whaca  0  «  K.  <  K,  <  . . .  <  K, ) .  Thit  aaana  taat  far 
<  Q  <  only  the  firat  i  routes  sbould  be  used. 

For  very  aaall  Q  oaly  route  ]  (the  one  with  saallest 
delay  la  used  aod  as  Q  increaacs  the  next  best  alter¬ 
nate  routes  are  added.  Work  is  presently  beint 
carried  out  to  prove  the  optiaality  of  thit  type  of 
strategy  for  the  stochastic  problen  considered  here. 
On  a  heuristic  basis  one  would  expect  a  threshold 
strategy  to  be  optiaal  since  the  first  ten  of  (1)  is 
satfll  when  all  routes  arc  used  and  the  second  tem  it 
■iniaixed  when  only  route  1  is  used.  For  very  ssmII 
valuta  eif  Q  it  ahould  be  best  to  use  oaly  route  1  but 
when  Q  reaches  a  level  at  which  the  increased  cost  of 
using  the  next  best  route  2  it  offset  by  the  anount 
that  0  can  he  reduced,  then  route  2  should  also  be 
used.  The  sane  rationale  for  larger  values  of  Q 
leads  to  the  threshold  strategy  outlined  above.  Thit 
is  the  type  of  routing  policy  which  ia  coasidered  in 
the  following  diacuaaion. 

Substitute  X.  B  It.  F.  into  (1)  where  p.  *  the 
service  rate  for  Foute  *i  (V.  *  M  C.  where  ia  the'' 

capacity  of  the  first  link  in  route^  i)  and  ia  the 

probability  that  route  i  ia  used.  We  seek  to  aini- 
aixe  the  cost 


where  7(k)  j  Prob(Q  *  k|.  liote  that  oaly  route  I 
(server  1>  it  used  (or  1  £  Q  ^  K  whereas  both  routes 
(servers  1  and  2)  arc  used  for  Q  >  K»l.  The  set  of 
equations  (<•)  are  easily  solved  to  give 

P(0)  for  C  <  k  <  K 

P(k)  «  \  K  r  '  ~  <5) 

where  y,  ^  VMj  ‘Bd  Yj  ^  V(Mj  ♦  Mj).  Suaaing  (5) 
over  all  k  and  equating  to  1  gives  for  ^  ( 

1  (1-Y,) 

A/dh\  ^  _ _  —  _ _ _ * _ _  /^\ 


where  e  ^  (Y,-Y2)/0-Y2)- 

Since  route  1  is  used  whenever  ()  >  0 

„  S 


Yi-«Yi 

P,  •  1  -  P(0)  «  -i - ^ 

l-«Yi 


for  Yj  P  1 


J  ■  XT  a  I(Q)  4  2  S.  M.  P.  .  (3) 

im  ‘  ‘  * 

by  choosing  the  threshold  levels  K.  <  S.  <  ...  <  Kr* 
Vote  that  K.  ■  0  because  whenever  *<)~>  O^soac  service 
auat  be  provided  so  route  1  (defined  as  the  route 
with  the  satalleat  delay)  should  be  used.  The  rela¬ 
tion  between  the  cost  and  threshold  levels  is  cost- 
plicated  In  the  general  ease  of  L  alternate  routes  so 
we  will  proceed  with  the  case  of  I  e  2  to  denonstrate 
the  approach. 

TWo  Alternate  Koutes 

We  firat  will  consider  a  sisiplified  situation 
-  (adnittedly  unrealistic)  where  awapping(S)  is  allowed 
between  the  two  servers.  That  ia,  if  server  2  is 
busy  when  server  1  coaipletes  a  service  which  results 
in  Q  <  K  then  the  cuatesMr  being  served  by  server  2 
has  this  service  cowpletad  by  server  1.  This  results 
ia  the  one  diaMnsional  state  transition  disgraw  shown 
ia  Fig.  2.  When  swapping  is  not  pcrsitced  (NS)  then 
any  service  begun  by  a  server  nust  be  conpleted  by 
the  sane  server.  The  threshold  strategy  leads  then 
to  the  state  transitioa  diagraai  of  Fig.  3  where  the 
states  labled  (ka)  denote  that  k  custowers  are  in  the 
systen  hut  only  server  1  is  being  used  to  provided 
service. 

(EOIOST'  •  •  ^J£0£C  ■  ■  ■ 


rig.2  State  transition  diogram  for  S  model 
using  threshold  K. 

For  Fig.  2  we  find  that  the  S  wodel  is  charac¬ 
terised  by 

XP(k-l)  •  Mj  P(k)  for  I  <  k  <  K 


XP(k-l)  e  t**'  If**  1  ** 


tquations  (6)  and  (7)  and  those  to  follow  are 
valid  far  >  1  as  well  as  <  1  provided  Yj  <  1. 

Analogous  equations  for  y^  *  1  can  be  easily  ob¬ 
tained  for  the  entire  developaent  presented  here. 
Noute  2  is  used  only  for  Q  >  K,  so  for  Y^  P  * 

K  (Y,-®)Y,* 

P,  ■  1  -  2  P(k)  •  — i - ^  (S) 

'  kwO  l-dYj 


Using  E(q)  •  Z  kP(k)  with  (S)  and  «)  gives  for 
k-0 


Yj  t  *. 


’  Y,  /y,(i-y,)  Y,  \  rl  1 

EIQl  -  —  *(-2 - 1 - K-  ItojYi  - 4 

Li-Y,  \(*-Y2)"  (l-Y^)^  /  J>-V 


Let  X  denote  the  routing  cost  associated  with 
(3).  Then  (7)  and  (8)  give  for  y^pl 


Sj  M,  P,4Sj  Mj  Pj 


S,M^(yi-WY;)*S^>i2(Yi-n)Yi 


We  observe  that  as  K  ■*  •,  E{Q}  in  (9)  in¬ 

creases  nonotonically  and  R  in  (10)  decreases 
sMnotonicslly  so  there  is  sowe  value  N*  for  which  J 
w  E(Q)  4  R  has  a  ninisiun.  Thit  value  could  be 

found  by  a  direct  search  or  its  approximate  value 
can  be  detemined  by  finding  expressions  for  the 
changes  AE{Q)  and  AR  as  functions  of  K.  The 

eptinua  value  K*  will  be  an  integer  near  the  value 

K  at  which  AE  4  a  R  <  0.  In  this  case  we  find. 


i  *  «  (Sj-S,)(M,-X)  4 


Mi4P2*X 


When  y.  <  I  then  neglecting  the  second  tens  on 
the  left  of  (11)  gives  a  starting  point  for  a  one 


■t 


1  <  k  <  K 


(17) 


4lM«aioaal  taarch  (or  K.  IAmo  Y<  *  1  aiocc  tkc 
toraa  oa  tka  ri|bt  ara  all  aagat^c  wa  (lad  that 


i  > 


- 

la  |(S,-S,){X-M,)  ♦  -  ♦  >  \ i 

A  ^*‘*1 


Y, 


(12) 


aa  Cha  right  aida  a(  (12)  caa  ba  uaad  to  atart  a 
oaa  dlMaaiaaal  aaarch  for  K*. 


Fig. 3  Stott  trontitlon  diogrom  for  NS 
inodtl  with  thrtahold  K. 

Tar  tka  VS  aodal  whara  awappiag  la  aat  allawad 
aa  (lad  (raa  Fig.  3  that 

(a)  X  KO)  ■  M|  T,(l)  ♦  Mj  P(l) 

(b)  X(t(k«l)ap  (k*l)]  ■  M,(f(k)'»p.(k)]  ♦  p,  p(k) 

far2<k<I*  *  *  * 

'  "  («) 

k 

(e)  Xt(k)  «  Z  t(J)  ■  M.  T(k+1)  (or  1  <  k  <  V 
*J«1 

(d)  X  p(k)  ■  (Vj-nij)  T(k'rl)  far  fal  <  k 
(a)  X  (p(K)  ♦  p^(K)l  «  (M,n«j)  P(V^I) 


ahara 

a 

p(k)  ■  Prab  (Q  ■  k  aad  aaraar  2  la  buay) 

f^(k)  a  Prab  (Q  ■  k  aM  aaraar  2  la  aat  buay) 

It.  (13d)  caa  bo  aalaod  ta  glaa 

P(k)  a  p(Kai)  far  k  >  *♦!  (U) 

ahara  p(K^l)  la  obtalaad  frea  (13a)  aaca  p(K)  aad 
a  (K)  ara  kaewn.  Eguatioaa  (I3a-c)  giva  2X  oquatlena 
far  tha  raaainlag  (EX'*])  uakaowa  P(0),  p(l),  ... 
P(«,  P,(l).  ...  P,(K). 

Tha  addltloaal  coadltloa  aaadad  ta  aolaa  la  aa 

aaual 


■ 

X  Kk)  a  1  (15) 

kaO 

Aa  aaalytleal  aolutioa  of  theaa  aquationa  laada  to 
eoapllcatad  aapreaaiona.  Howavar  a  conputatioaal 
•alutloe  la  aaaily  obtalaad,  ainca  (13c)  caa  ba 
aalaad  Itarativaly  ta  give  p(2)  ...  p(Kvl)  la  tanaa 
of  p(l).  That  la  if  wa  dafiaa  tha  ceafflciaota  b^  by 

P(k)  g  b^  p(l)  far  1  <  k  <  (♦!  (16) 

thaa  ualag  thia  la  (13e)  givaa 


’k*l 


h  ^  *  h 


k 

X 

J-1 


'b,  far 


whara  b,  «  1  aad  y*  t  M./M,-  Or  for  aaaiar  cal* 
culatia^  dafiaa  ^  >  2  i 


k 

d^  A  Zb,  aad  Iterate 
*  ■  J«1  J 

*k  •  Vi  ♦  ^ 

‘kal  “  ^1  ^k  *  »3  *k  *•*  *  <  k  <  K 

atartlag  with  d^  ■  0,  b^  ■  1.  Alaa  alaca  P(k)  *  p(k) 

a  p.(k)  (Vau  P(k)  >  p(k)  far  k  >  Wl)  aq.  (13b) 
wltr  (16)  givaa 

Kk)  •  Yj  Kk-l)  -  Yj  b^  p(l)  far  2  <  k  <  K  (19) 
wbara  fraa  (13a) 

P(l)  ■  Yj  KO)  ♦  (l-Yj)  p(l)  (20) 

Tha  aalutlaa  of  (19)  caa  ba  wrlttaa  aa 

Kk)  •  y,^^‘*^K1)  -  Yj  ^ail 

far  k  >  2 . K.  (21) 


aa  by  daflalag 


*  Jal  ‘ 


for  1  <  k  <  K 

(22) 


we  caa  Iterate 


■kai 


^1  -k 


*  ^k41  f*'  *  5  k  <  K-1 


(23) 


atartlag  with  a,  ■  0.  Thua  wa  oaa  that  (20)  aad 
(21)  bacoM  ‘ 

F(k)  »  y/  KO)  ♦  y/^’^^J-Yj)  p(1)-Y3\  P(l) 
t  <  k  <  I. 

Tbua  all  the  P(k)  caa  ba  cspraaaad  la  tana  of 
p(l)  aad  P(0).  But  p(l)  Itaalf  caa  be  azpreaaad  ia 
tarwa  of  P(0)  bccauaa  (13a)  aad  (16)  gives 

Yj  P(K)  a  p(K^i)  .  bjj^j  p(l)  (25) 

aa  ualag  P(K)  frea  (24)  la  (25)  givaa 

Y  ' 

p(l)  »  - - - =7r -  KO)  ft  PP(0) 

^^ral^’2^  ^  ^3  ^'*'*3^  (26) 

Tbua  all  tha  P(k)  ara  erprctaiblc  ia  tanas  of  P(0). 
Wban  these  arc  laacrtad  iato  (15)  we  fiad  that  for 
Y,  P  1. 

'  1 
P(0)  a  - 5- 


!2l 

>-Y, 


♦  P 


r,.,(K-l) 

Hfj 


>-Y, 


Vi  1 

(1-Y,)-Y,  X  \ 

^  Vl  Y2(J-Y2)J 
(27) 
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where  0  it  detiaed  by  (26),  to  til  other  probtbili* 
tiet  are  detervioed.  Here 

P,  ■  1  -  P(0)  -  p(l)  »  1  -  (lep)  P(0) 

‘  (M) 

K  K  K 

P,  «  I  -  P(0)  -  2  p  (k)  «  1  -  I  P(k)  ♦  I  p(k) 

*  k»l  •  k»0  k-l 

•ad  at  before 

E(Q}  >  2  k  P(fc)  (2P) 

kaO 

*  *  ‘‘l  ^l  *2  ^2  ^2 

caa  thea  be  calculated. 

laaaale  1 

Aa  eaaaple  with  |kb  30,  m.*  10,  K*  25  aad  S.*  1, 
2  wat  aaalyaed.  Ae  returtt  are  thow  ia  Fi|.  4 
nr  both  the  S  aad  NS  aedelt.  la  that  eaaaple  E(Q} 
iacrcatct  aoBotoaically  with  thraihold  E  for  both 
aedela.  For  aett  walwet  of  E  the  NS  tMdel  givet 
•lightly  lower  ealuet  because  the  aecoad  tarver 
operate*  aero  of  the  tiae  thaa  for  the  S  aodel.  The 
appareat  iaconaittaacy  for  Eal  it  readily  eaplained 
by  the  fact  that  ia  the  NS  aodel  it  ia  pottihle  for 
the  fatter  aerver  (ia  thia  case  server  1)  te  be  idle 
while  the  slower  aerver  ia  busy.  Fer  low  values  of 
threshold  this  happeat  aore  oftea  with  the  result 
that  E(<))  it  raised.  Exaaple  2  will  preseat  a  aore 
strikiag  illuftratiea  of  tbit  effect  ia  which  EiQ) 
will  aot  be  aoaotoaic  hut  will  actually  ditplay  a 
aiaiaua.  At  E  •  both  aodels  approach  the  H/H/I 
case  siace  the  secoad  server  is  aot  used,  to  lia  E{Q) 
a  a  5.  The  routiag  costs  E  decrease  aoaoto* 

aically  with  threshold  E  at  the  aore  costly  route  2 
is  used  less  frequeotly.  At  expected  NS  hat  larger 
values  for  E  thaa  S  does  becauae  NS  caaaot  switch  a 
enstoaer  beiag  served  by  server  2  to  server  1  Whea  Q 
Calls  below  tba  threshold  value.  As  E-Mi.P.  1,  P, 

0  so  E  ■*  23  fer  both  aodels.  The  total*  cost  J*i» 
sbowB  is  Fig.  6b.  NS  gives  Jaia  a  29.197  at  E*  a  9 
aad  S  gives  Jaia  a  29.166  at  E*  a  7  but  the  aiaiaa 
are  fairly  bread.  Ia  this  exaaple  usiag  the  siapler 
S  aodel  to  fiad  the  aiaiaua  would  give  perferasace 
iaperceptably  differeat  thaa  usiag  the  aore  realistic 
NS  aodel. 


I  S  10  IS  ■ 
Pio  Ok  TaltICail  «w  tuaiU  1 

Aa  X  decreases  we  expect  that  E*  will  iacreate 
siace  server  1  caa  aore  easily  serve  the  deaaad  by 
itsalf  aad  at  tasw  poiat  E*  will  be  iafiaite  (i.c. 
it  is  best  te  use  ealy  route  1).  As  X  iacreaset 
server  1  requires  aore  help  ia  heldiag  E|Q)  dewa  so 
E*  decreases.  Is  fact  fer  Pj  £  X  <  Pj  «  M2  server  2 

aust  be  used  aeae  of  the  tiae  te  keep  EiQ}  fiaite. 

Fig.  S  shows  the  total  delay  T  *  J/X  for  cbc  NS 
server  of  Exaaple  1  with  X  a  13,  23  aad  33.  For  A  * 
13,  T  decreases  aeaoteaically  with  E  gi*iag  E''  >  • 
aad  T  aia  a  16/13  a  1.067.  For  X  a  33  we  tee  that 
T-Mi  at  E-m  siace  X  >  p, .  A  aiaiaua  occurs  at  Eo  a  4 
giviag  T  aia  a  S0.2/33‘a  1.636.  The  X  a  23  curve  it 
the  NS  curve  of  Fig.  6b  divided  by  23. 


Ia  Exaaple  1,  the  route  with  saaller  routiag 
delay  I,  also  had  the  faster  server  (p.  >  Pj)  so  the 
delay  ra  service  usiag  server  I  aad '  the  Tielay  to 
reach  the  destiaatioa  both  favor  route  1.  It  it 
evideat  that  aituatioas  can  ariae  where  these  two 
effects  opppese  each  other  se  that  if  U./P2  lorgc 
eaough  thea  evea  if  Sj  «  S,  route  1  can'stlll  be  the 
preferred  reuie  siace  tie  large  delay  ia  ustag 
server  2  outweighs  the  shorter  delay  ia  reachiag  the 
destiaotioe  aloag  route  2. 

Mere  Pj  a  35,  pj  a  S,  S,  a  1. 1,  Sj  ■  1  tod  X  a 

20.  Fig.  6a  shows  C|Q)  aad  E  for  the  NS  and  S 


4t  OiMfMMf  CMtl  t0f  i39m0t$t 


COST 


■odclt.  It  can  b«  seen  that  tor  the  NS  aotlel  ElCt 
haa  a  aiainiia  as  explained  above.  This  occurs  at  K  = 
4  and  lives  ain  E‘,i)|  =  The  routinf  costs  R 
increase  aouotunicaiiy  vitb  ii  here  because  >  $2- 

Fl|.  6b  shows  that  the  .NS  aodel  has  a  aioinua  at  > 
2  of  J  ain  a  23.114  unite  the  S  aodel  has  a  ainiaua 
of  J  ain  «  22.933  at  K*  >  1. 
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Eaaaple  2  Illustrates  that  the  relative  sites  of 

S.  and  S,  alone  cannot  be  used  to  detenaine  the  pre¬ 
ference  orderiai  of  the  alternate  routes.  An  enact 
deterainatien  can  be  aade  by  finding  C{Qi  *  R  frea 
(29)  with  K  a  1  when  first  ono  route  and  then  the 
other  is  assuaed  to  be  preferred.  The  route  yielding 
the  saaller  cost  will  then  be  the  preferred  route  1. 
An  approxiaate  and  aore  easily  applied  test  can  be 
used.  Suppose  only  one  custoaer  is  in  the  systen. 
If  route  i  it  chosen  then  the  average  delay  to  reach 
the  destination  is  I/m.  *  S.  since  the  first  tem  is 
the  average  delay  in  ierviA  sad  the  second  is  the 
additional  average  delay  encountered  along  the  route. 
Thus  the  aiost  preferred  route  labled  dl  is  the  one 
with  saallest  S.  *  1/m-  and  all  alternate  routes  can 
then  be  ordered  ny  this*' quantity. 

For  Che  values  of  C.xanpi.e  2  we  find  S,  *  l/p,  > 
1.13  and  Sj  *  l/Mj  =  1.2.  ‘ 


Dncentralired  Routins 


The  local  routing  strategy  just  discussed  it 
well  suited  for  use  in  a  decentralised  routing  schene 
for  larger  networks.  Since  each  node  n  is  niaiaizing 
the  average  time  T  that  it  takes  an  eutering  custo- 
■or  to  reach  the  destination,  if  this  infontatioo  it 
exchanged  periodically  betweeii  neighboring  nodes  it 
gives  each  nods  the  infotaation  needed  to  update  its 


S  values  Thus  foi  cxaanle  if  ‘t  the  propaga¬ 
tion  delay  [com  a  nod:  aieng  co'ii.  i  to  neighboring 
node  J  then  a.  a  t.  r  T  will  b:  axed  in  the  local 
routeing  strategy.  *'  Witl/  each  update  a  node  would 
recalculate  the  threshold  values  o.-.  which  its  real 
tine  routing  decision  is  based  using  the  latest 
value  of  S  ,  M.  and  A.  These  information  updates 
would  be  wide  at  a  rate  auch  slower  than  required 
for  tho  local  routing  decisions. 

Conclusion 

This  paper  suggests  an  alternate  route  strategy 
««hich  siaply  compares  the  number  ot  customers  Q  at  a 
node  against  an  optimal  thceshcld  value  K^.  When 
tho  threshold  is  exceeded  tLen  some  traffic  is 
diverted  along  an  alternate  route.  The  real  time 
strategy  is  dynamic  because  it  adapts  its  routing  to 
the  size  of  i).  It  is  also  adaptive  in  the  sense 
that  K*  can  be  recomputed  readily  when  operating 
parameters  such  as  |i.,  A  or  S.  change.  Thus  it 
seerna  ideally  suited^  for  use  In  a  decentralized 
routing  procedure  for  large  networks  where  a  low 
rate  exchange  of  information  between  oeigbhcring 
nodes  can  be  used  to  update  the  operating  thres¬ 
holds. 
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Abstract 

The  optimization  of  two-server  Markovian  queues  with  unequal 
rates  of  service  is  analyzed  via  the  theory  of  Markov  decision  pro¬ 
cesses  using  an  average  expected  cost  criterion.  It  is  shown  that  the 
optimal  policy,  that  minimizes  the  steady-state  average  time  delay 
experienced  by  customers  in  the  system,  is  a  deterministic  threshold. 
Under  this  policy,  the  slower  server  is  used  whenever  Ute  number  of 
waiting  customers  exceeds  the  threshold  value. 
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1.  INTRODUCTION 

Stochastic  optimization  of  queueing  systems  has  been  extensively 
studied  due  to  its  direct  applicability  in  diverse  engineering  and 
economic  disciplines.  Early  works  on  single  server  systems  dealt  with 
optimal  control  of  the  service  rate  in  Markovian  queues  as  in 
[CRABILL  -  72],  [2ACKS  -  70]  and  [BLACKBURN  -  72].  Optimality 
criteria  involved  service  costs/rewards,  service  rate  switching  penal¬ 
ties,  customer  delays  and  start-up/shut-down  costs.  In  [NAOR  -  69] 
the  queue  size  was  controlled  by  imposing  tolls  to  arriving  customers 
and  in  [KNUDSEN  -  72],  this  was  generalized  to  multiple  server 
systems.  In  several  queueing  control  problems  the  optimal  strategies 
depend  on  thresholds  on  the  queue  size.  For  excellent  reviews  on 
the  methodology  involved,  see  [PRABHU  -  73]  and  [STIDHAM  -  82], 
where  specific  hints  are  presented  on  how  to  apply  the  general  semi- 
Markov  decision  theory,  [HOWARD  -  71],  [ROSS  -  69]  to  queueing 
processes . 

A  wealth  of  applications  requiring  optimal  control  of  queues 
emerged  within  the  context  of  store-and-forward  data  communication 
networks.  Switching  nodes  in  the  network  can  be  modeled  as  queueing 
systems  with  various  streams  of  incoming  data  and  several  output 
options.  It  was  found  that  adaptive  routing  at  the  node  level  signifi¬ 
cantly  improves  the  deiay  performance  of  data  messages  as  reported 
in  [BOORSTYN  -  81],  [YUM  -  81).  In  [FOSCHINI  -  78]  a  diffusion 
approximation  was  used  to  analyze  node  models  where  newly  arriving 
messages  had  the  option  to  join  queues  dedicated  to  alternate  routes. 
In  [EPHREMEDES  -  80],  techniques  from  the  optimal  control  of  queues 
were  employed  to  derive  optimal  strategies  for  the  above  node  model. 
In  [SARACHIK  -  82],  the  adaptive  routing  problem  was  formulated  as 


an  M/M/2  queue  with  different  service  rates,  and  based  on  previous 
results,  [SARACHIK  -  81]  pertaining  to  deterministic  input  models,  a 
threshold  policy  was  anticipated  to  minimize  delay.  Note  that  an 
M/M/2  queue  exhibits  lower  delay  than  two  dedicated  queues  since 
arriving  messages  in  the  latter  case  decide  on  a  route  by  estimating  a 
priori  which  server  will  become  available  first.  In  figure  1  we  illus¬ 
trate  an  M/M/2  queue  with  two  unequal  service  rates,  >  Mjj-  If 
the  two  rates  are  not  substantially  asymetric,  immediate  use  of  any 
available  server  on  a  FCFS  basis  appears  to  be  an  efficient  way  to 
empty  the  system.  At  the  other  extreme,  however,  with  server  a 
many  times  faster  than  b,  it  is  reasonable  to  defer  use  of  b  unless 
the  number  of  messages  in  queue  is  large  enough  to  feed  the  fast 
server  for  the  duration  of  service  in  b.  Obviously,  the  "customer" 
in  b  receives  unfair  service  but  the  average  system  pe-formance  im¬ 
proves.  This  is  an  illustration  of  the  discrepancy  between  "indivi¬ 
dual"  and  "social"  optimization  as  reported  in  [KNUDSEN  -  72].  In 
what  follows,  we  formulate  the  M/M/2  optimization  as  a  Markov  decision 
process  and  prove  that  the  optimal  policy  under  certain  conditions  is 
of  the  threshold  type. 

2.  MARKOV  DECISION  PROCESS  FORMULATION 

The  optimization  problem  involves  the  minimization  of  the  average 
number  of  customers  (messages)  in  the  system,  including  those  in 
service.  This  is,  by  Little's  formula,  equivalent  to  minimizing  the 
average  message  delay. 

The  state  must  indicate  the  number  of  customers  in  the  system 
and  denote  whether  each  of  the  servers  a  and  b  is  busy  servicing  a 
customer.  Let  the  state  be 


0 


Empty  system 

ia,  i  >  1  :  i  customers  in  the  system  and  only  server  a  busy 

ib,  i  >  1  :  i  customers  in  the  sytem  and  only  server  b  busy 

i,  i  >  2  i  customers  in  the  system  and  both  a  and  b  busy. 

Actions  may  be  taken  whenever  an  arrival  finds  one  or  both 
servers  empty  and  a  choice  must  be  made  whether  or  not  to  serve  or 
which  server  to  use,  and  whenever  a  departure  creates  the  opportu¬ 
nity  for  a  waiting  customer  to  proceed  to  a  server.  In  order  to  "dis¬ 
cretize"  the  continuous  time  process  defined  on  the  state  space  above, 
we  uniformlze  with  respect  to  R,  the  maximum  total  rate  out  of  all 
states.  Obviously  R  =  X  +  corresponding  to  a  state  i,  with 

both  servers  busy.  For  states  ia,  ib  and  0  we  now  consider  transi¬ 
tions  at  rate  R  some  of  which  represent  arrivals  or  departures  and 
others  represent  "self  loops"  or  transitions  not  triggered  by  external 
events.  As  an  example,  from  state  ia,  i  >  2,  we  may  have  the  follow¬ 
ing  transitions: 


Upon  an  arrival,  with  rate  X 

ia  -► 

(i  +  l)a 

OR 

ia  -► 

(i  +  1) 

Upon  a  departure,  with  rate 

a 

ia  -*• 

(i  -  l)a 

OR 

ia  -*• 

(i  -  1) 

OR 

ia 

(i  -  l)b 

Remaining  transitions  with  rate 

ia 

ia 

OR 

ia  -► 

i 

This  uniformization  technique  if  suggested  in  [STIDHAM,  82]  as  a 
means  to  simplify  the  algebra  involved  with  the  optimaiity  equations 
without  changing  the  steady  state  statistics  of  the  process. 
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The  possible  transitions  out  of  a  state  depend  on  the  actions 
taken  at  transition  points,  whenever  there  is  a  choice.  The  action 
space  is  finite;  either  serve  the  customer  at  the  head  of  the  queue 
with  a  specific  available  server  or  n  t.  No  swapping  is  allowed  i.e.  a 
message  already  in  server  a  may  not  be  switched  to  server 
b.  Furthermore,  we  assume  that  if  there  are  customers  in  the  system, 
at  least  one  server  must  be  active.  This  assumption  may  be  proved 
but  we  omit  the  formalism  to  provide  a  clear  picture  of  the  state 
transitions.  In  Figure  2  we  depict  the  state  diagram  with  all  candidate 
transitions.  Solid  lines  and  circles  identify  a  set  of  actions  leading  to 
a  threshold  policy  at  i  =  3.  Under  that  policy  all  dotted  states  are 
transient  and  do  not  influence  the  steady  state  averages.  Rates  such 
as  p”  denote  transitions  opposing  to  the  threshold  policy  but  valid 
candidates  for  an  optimal  policy. 

The  optimal  policy  n,  would  minimize  the  average  expected  cost  g 
uefined  as  the  average  number  of  customers  j  under  n: 

g  =  min  {E^(j)} 

71 

If  C(Xj^)  denotes  the  cost  of  state  at  the  n^  period  between  the 
n^  and  the  (n  +  l)st  transition  (with  average  dwell  time  1/R),  then 

n-0 

The  one  period  cost  C(X^)  is  exactly  the  number  in  system  at 
C(i)  =  C(ia)  =  C(ib)  =  i 

In  order  to  use  the  results  of  the  Markov  Decision  theory  direct¬ 
ly,  we  first  limit  the  queue  capacity  (maximum  number  of  customers) 
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to  a  large  but  finite  number  M.  Arrivals  into  a  full  system  are 
blocked  and  cleared.  The  minimization  criterion  involves  customers 
already  in  the  system  and  ignores  the  effect  of  blocking.  This  trun¬ 
cation  is  a  standard  technique,  [PRABHU  -  73]  to  alleviate  problems 
induced  by  unbounded  one  period  costs  and  to  handle  ergodicity 
requirements  for  average  expected  cost  minimization.  Certainly  if  M 
is  large  enough  and  the  process  stable,  truncation  is  a  good  approx¬ 
imation.  Furthermore,  it  will  be  shown  that  the  form  of  the  optimal 
policy  does  not  depend  on  M  and  thus  we  may  postulate  that  if  a 
stable  optimal  policy  exists  it  is  of  the  same  form  as  in  the  truncated 
problem. 

Under  the  modeling  assumptions  stated  above,  all  conditions  for 
the  existence  of  a  deterministic  stationary  optimal  policy  are  satisfied, 
[ROSS  -  69,  p.  149]  and  thus  the  policy  n  minimizing  the  average 
number  of  customers  in  the  system  is  a  deterministic  assignment  of 
actions  to  states. 

3.  FORM  OF  THE  OPTIMAL  POLICY 

In  order  to  determine  the  form  of  the  optimal  policy,  we  examine 
the  equivalent  discounted  cost  problem  with  a  discount  factor  a  <  1. 
As  shown  in  [ROSS  -  69]  this  problem  can  be  transformed  into  an 
average  expected  cost  minimization  by  letting  a  -»  1. 

Let  V(i)  V(ia),  V(ib)  denote  the  a-optimal  costs  starting  from 
states  i,  ia,  and  ib  respectively.  With  representing  the  state 
during  the  n^  period.  We  have 

Q» 

V(X.)  =  min  E„{[  1  a*'  C(X„)]/X.} 

^  7t  "  n=0  n  u 

Xq,X^  e{i,ia,ib},  C(i)  =  C(ia)  =  C(ib)  =  i 


■0 


0 


The  recursive  optimality  equations  for  the  Vs  are  given  below: 

V(0)=a  [\-inin{V(la),  V(lb)l  +  (M3+Mb)V(0)]  (1) 

(X+Ma+Mb)  V(la)=l+a[\-min{V(2a),V(2)}  ♦  V(0)+MjjV(la)]  (2) 

+  pjj  V(0)+MaV(lb)]  (3) 

(X+M3+Mb)V(2a)=2+o[X*min{V(3a),V(3)}+  M^min{V(la),V(lb)}  + 

Mjjmin{V(2a).V(2)}]  (4) 

(X+Ma+Mb)V(2b)*2+a[X-min{V(3b),V(3)}-*-M^min{V(la),V(lb)}  + 

M3min{V(2b),V(2)}l  (5) 

(X+P3+Mb)V(2)=2+<y[X*V(3)+Ma  V(lb)+Mjj  V(la)]  (6) 

(X+p^+Mjj)V(i,a)=i+a[X*min{V(i+l,a),V(i+l)}+p^min{V(i-l,a),V(i-l,b),V(i-l)} 

+Mjjmin{V(i,a),V(i)}],  i>2  (7) 

(X+Ma+Mb)V(i.b)=i+a[X-nun{V(i+l,b),V(i+l)}+pjjmin{V(i-l,b),V(i-l,a),V(i-l)} 

+Mamin{V(l,b),V(i)}],  i>2  (8) 

(X+Ma+Mb)V(i)=i+or[X-V(i+l)+Mb"^tV(i-l),V(i-l,a)} 


+Pamin{V(i-l),V(i-l,b)}], 


i>2  (9) 


If  M  is  the  maximum  (blocking)  state 
(X+Ma-*-Mb)V(M.a)=M+a(X-V(M,a)+Pamin{V(M-l,a),V(M-l,b),V(M-l)} 

+Mjjmin{V(M,a),V(M)}]  ( 

(X+Ma+Mb)V(M'^)=M+a(X-V(M,b)+Mbnun{V(M-l,b).V(M-l,a),V(M-l)} 

+Mamin{V(M,b),V(M)}]  ( 

(X+Ma+Mb)^(M)=M+a[X-V(M)+Mbmin{V(M-l).V(M-l,a)}+ 

+p,min{V(M-l),V(M-l,b)}  ( 

a 

Let  R  denote  the  unformization  rate; 


R  =  X+Pa+Mb 


LEMMA  1 


I  The  functions  V(i),  V(ia),  V(ib)  are  increasing  in  i  and 

V(la)>  V(0).  V(lb)>V(0),  V(2)>V(la),  V(2)>V(lb) 

Proof 

I  We  will  use  induction  on  the  recursive  equations  (l)-(9).  First 

assume  that  we  start  with  a  set  of  Vs  satisfying  the  Lemma.  A 
logical  choice  is  to  set  the  cost  at  step  0  equal  to  the  one  period 
I  costs : 

Suppose  now  that  the  lemma  holds  after  n  subsequent  applications  of 
I  the  recursive  equations  to  V^: 

Vj^(i),  sre  increasing  on  i 

To  complete  the  proof  we  must  show  that  the  lemma  will  hold  at  the 
I  (n+l)st  step.  We  will  demonstrate  this  step  for 

(9)  we  have: 

RVn^l(i+l)=i+l+a[AVn(i+2)+Mbmin{Vn(i),Vn(i,a)l+ 

I  +M3min{V^(i),V^(i,b)}] 

RVn+i(i)=i+or[XVn(i+l)+Mbn“n{Vn(i-l),Vn(i-l,a)+ 

+Mgmin{Vjj(i-l),V^(i-l,b)}] 

I  Now,  i+l>i,  Vj^(i+2)>V^(i+l)  (by  assumption),  min  {V^(i),V^(i,a)}  > 

min{V  (i-l),V  (i-l,a)}  and  min{V  (i),V  (i,b)}>min{V  (i-l),V  (i-l,b)} 
n  n  n  n  ai  ii 

since  the  minimum  of  two  increasing  functions  is  increasing.  Thus 
RV^^j^(i+l)  >  Using  the  same  reasoning  it  can  be  easily 

seen  that  all  costs  increasing  on  i  up 

to  its  maximum  M  and  that  ^  ^  ^ 

i 

I 


^  n 


VM  i 


J 
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LEMMA  2 

If  V(la)<V(lb),  and  V(i)<V(ib),  i>2 

Proof 

We  use  the  same  inductive  argument  as  in  the  proof  of  Lemma  1. 
Assuming  that  the  inequalities  hold  at  step  0  and  step  n,  we  have  at 
the  (n+l)st  step 

RV^^.l(la)  =  1  +  of[\-min{V^(2a),V^(2)]  +  MaV^(O)  +  M^V^da)} 
RV^^ldb)  =  1  +  a[\V^(2)  + 

In  order  to  show  that  since  Vj^(2)  >  min  {V^(2a), 

V^(2)},  it  suffices  to  show  that 

^  MbV„(0)  ♦  M^V^db)  or 

0^V„(lb)  -  Mi,V„(la)  >  -  Mb)V„(0) 

Since  V^(lb)  >  Vj^(la)  (by  inductive  assumption),  and 

Vj^(la)>Vj^(0)  (Lemma  1),  it  follows  that 

‘'aV„<l‘»  -  Mb''n<l®>"<‘'a  ’  Mb>V„(l^)><M,  -  Mb)V„(0) 

This  proves  the  first  argument,  V(la)<V(lb),  assuming  Vj^+^CO 
V^^j^(ib).  From  (5)  and  (6),  applying  the  inequalities  at  step  n,  we 
have 

RV„^.i(2)  =  2  +  a[XV^(3)  +  MaV^db)  + 

^Vl^2b)  *  2  +  of[AV^(3)  +  Mb'^jj(la)  +  MaV^(2)] 

From  Lemma  1,  we  have  V^(2)  >  V^(lb),  thus  V^^j^(2)  <  Vj^+i(2b) 
From  (8)  and  (9),  we  have  that  for  i>2 
RV^^l(i)=i+a[\Vn(i+l)+Mbmin{Vn(i-l),Vn(i-l,a)}+MaVn(i-l)] 
RVj^^l(ib)=i+of[\Vn(i+l)+Mbtnin{Vn(i-l,a),Vn(i-l)]+MaVn(i)] 
and  since  Vj^(i)>V^(i-l)  it  follows  that  Vj^+i(i)<V^^j^(ib). 


Note  that  we  make  use  of  arguments  from  Lemma  1  in  the  induc¬ 
tive  proof.  This  is  valid  since  the  two  Lemmas  are  consistent  and 
may  start  from  the  same  initial  set  of  Vs.  Alternatively,  one  can 


combine  the  two  Lemmas  into  a  single  inductive  proof. 

COROLLARY  1 

States  ib,  i>l  are  transient  under  the  optimal  policy. 

Proof 

Observing  the  optimality  equations,  we  notice  that  under  Lemma 
2,  there  is  no  path  from  states  (i)  and  (ia)  into  any  state  (ib),  i>l. 
State  (lb)  may  be  reached  upon  departure  from  2.  This  statement 
conforms  with  the  intuitive  argument  that  there  is  no  reason  to  use 
the  slow  server  when  the  fast  server  is  idle. 

LEMIvLA  3 

If  V(i-l,a)>V(i-l),  i-l>l,  then  V(j,a)>V(j)  for  all  j  >  i 

Proof 

Assume  that  in  the  optimal  policy,  there  exists  a  state  i-1,  such 
as  V(i-l,a)  >  V(i-l).  In  this  case,  we  will  use  the  inductive  procedure 
again  but  now  keeping  the  costs  of  states  (i-1, a)  and  (i-1)  fixed  to 
their  optimal  values.  The  iterative  application  of  the  optimality  equa¬ 
tions  to  the  remaining  costs,  will  converge  by  contraction  to  their 
optimal  values.  The  a-priori  knowledge  of  V(i-1,';  and  V(i-l)  will 
reduce  the  number  of  unknowns,  and  if  our  guess  is  correct,  it 
just  creates  two  redundant  consistent  equations. 

At  step  0  we  set  VQ(i-l,a)=V(i-l,a)  and  VQ(i-l)=V(i-l).  The 
remaining  V^'s  are  set  to  arbitrary  initial  values  satisfying  the  Lemma 


and  the  monotonicity  property  of  Lemma  1,  in  reference  to  V(i-l,a) 
and  V(i-l).  It  can  be  easily  seen  that  the  monotonicity  property  will 
hold  throughout  the  subsequent  operations,  by  translating  the  proof 
of  Lemma  1,  to  incorporate  the  fixed  values  V(i-l,a)  and  V(i-l). 

Assume  that  at  step  n  the  Lemma  holds.  Remember  that  no 
change  is  allowed  for  the  cost  of  states  (i-2,a)  and  (i-1).  From  (7) 
and  (9),  we  have 

RV^^^(i.a)=i+a[XV^(i+l)+M^V^(i-l)+MjjVj^(i)] ,  V^(i-l)=V(i-l) 

RVn+l(i)=i+oi(\Vn(i+l)+MbVn(i-l)+Ma^n^^‘^^^ 

and  since  Vj^(i)>V^(i-l),  it  follows  that 

equations  apply  for  all  j  >  i.  At  j=M,  V^(j+1)  is  replaced  by  V^^CM) 
and  similar  inequalities  hold.  This  completes  the  proof  of  the  Lemma. 

THEOREM  la 

The  a-optimal  policy  for  the  finite  M/M/2  queue,  defers  service 
from  the  slow  server  until  the  number  of  waiting  customers  exceeds  a 
threshold  value. 

Proof 

From  Lemma  3,  it  follows  that  for  the  least  K  >  1  for  which 
V(K+l,a)>V(K+l),  all  states  (ia),  i>K,  will  exhibit  greater  costs  V(ia) 
than  V(i).  Thus  new  arrivals  and  departures  will  favor  states  (ia), 
for  i  <K  and  states  (i)  for  i>K.  This  threshold  K  will  render  all 
states  (ia),  i>K,  transient  along  with  states  (ib),  i>l  (Corollary  1). 
The  threshold  value,  depends  on  the  parameters  It  may  be 

that  for  the  threshold  K  approaches  the  limit  of  the  queue 

size  in  which  case,  we  never  use  the  slow  server  b.  Equivalently  if 
Ma/Pb  is  close  to  unity,  the  threshold  moves  to  1  which  is  exactly  an 
M/M/2  queue  with  arrivals  on  empty  state  routed  to  server  a. 
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THEOREM  lb 

The  optimal  average  expected  cost  policy  for  the  finite  M/M/2 
queue  is  a  deterministic  threshold  as  in  Theorem  la. 

Proof 

For  the  finite  state  problem,  state  (0)  is  recurrent  for  all  deter¬ 
ministic  policies  considered  in  the  a-optimal  recursive  equations  (1)- 
(12).  Furthermore  the  one  period  cost  i  is  bounded  by  its  maximum 
M  and  hence  all  conditions  for  the  existence  of  optimal  average  expected 
cost  policies  are  satisfied  as  in  [ROSS  -  70],  Corollary  6.20.  The 
structure  of  the  optimal  policy  7t  will  be  the  same  as  in  the  discounted 
case  and  the  optimal  average  expected  cost,  g=E^(i)  is  given  by 

g  =  lim  (l-a)V  (0) 

The  truncation  of  the  state  space  was  introduced  to  guarantee 
that  the  one  period  costs  are  bounded,  and  that  state  0  is  recurrent 
under  all  policies.  However,  since  the  threshold  structure  does  not 
depend  on  the  maximum  queue  size  M,  it  can  be  concluded  that  if  a 
stable  optimal  policy  exists  to  the  infinite  state  problem,  it  is  of  the 
threshold  type. 

4.  DISCUSSION  AND  NUMERICAL  EXAMPLES 

In  the  previous  section  we  proved  that  the  optimal  policy  mini¬ 
mizing  the  average  queue  length  (or  average  delay)  is  of  the  thresh¬ 
old  type.  Obviously  the  criterion  involves  "social  optimality"  from  the 
system  point  of  view.  Individual  customers  (messages)  would  never 
chose  the  slow  server  regardless  of  the  state  of  the  system  unless 
their  service  time  on  the  slow  server  is  less  than  the  sum  of  their 


service  time  in  the  fast  server  and  the  waiting  time  before  being 
served  by  the  fast  server.  The  optimiun  threshold  value  depends  on 
the  parameters  of  the  system,  namely  the  ratio  of  the  service  rates 
and  the  total  system  utilization.  In  order  to  illustrate  this  dependence 
we  solved  the  equilibrium  equations  in  an  infinite  size  system  for 
various  thresholds,  service  rate  ratios  and  traffic  intensities.  In 
Figure  3,  we  depict  the  average  queue  length  at  traffic  intensity 
V(Mg+|Jb)=0-5-  As  expected  if  server  a  is  much  faster  than  b  (39  to 
1),  the  optimal  threshold  becomes  very  large,  eventually  leading  to  a 
single  server  queue  with  the  slow  server  never  used.  Conversely  for 
relatively  balanced  rates  (2  to  1  or  equal),  both  servers  are  always 
used  as  in  an  uncontrolled  M/M/2  system  (preference  is  given  to  a  if 
both  a  and  b  are  idle).  For  rate  ratios  3  to  1  and  7  to  1  there 
exists  clearly  an  optimal  threshold  at  2  and  4  respectively.  A  con¬ 
sequence  of  using  thresholds  is  that  we  obtain  ”.nbalanced  traffic 
volumes  routed  via  a  and  b.  In  Figure  4,  we  illustrate  the  percentage 
of  the  traffic  routed  via  a,  100  x  \  /\.  Notice,  that  even  with  bal- 

a 

anced  rates  1  to  1  and  K=1  (pure  M/M/2  system),  more  traffic  is 
routed  to  a  than  b,  due  to  decisions  taken  upon  arrivals  to  an  empty 
system.  As  the  threshold  increases,  certainly  the  \  /\  ratio  ap- 

a 

proaches  100%.  This  observation  led  to  the  suggestion  of  threshold 
policies  in  adaptive  routing  schemes,  where  the  traffic  served  via 
each  outgoing  link  (server)  is  constrained  to  a  value  determined  by 
some  global  criterion.  In  [MAGLARIS  -  83],  we  formulated  this 
problem  and  proved  that  threshold  policies  minimize  average  queueing 
delays  with  constrained  output  flows. 
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Finally,  in  Figure  5,  we  demonstrate  the  effect  of  the  traffic 
intensity  on  the  optimum  threshold  value.  As  intuitively  expected, 
queues  close  to  saturation  do  not  have  the  luxury  of  threshold  poli¬ 
cies  and  must  operate  both  servers  at  full  capacity.  Lightly  utilized 
queues  do,  however,  exhibit  improved  performance  under  threshold 
values  which  increase  as  the  utilization  drops.  Depending  on  the 
Pa  .’Mb  ratio,  a  system  utilized  at  low  levels  would  involve  thresholds 
approaching  a  positive  integer  close  to  “1-  As  an  example, 

for  *  7,  and  very  low  traffic  intensity,  the  optimal  policy  uses 

the  slow  sei:ver  only  if  7  messages  have  been  accumulated,  i.e.  the 
threshold  value  is  6. 


.1 
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ABSTRACT 

We  consider  several  versions  of  the  Knapsack  Problem  and  show 
that  they  can  be  solved  using  a  shortest  path  algorithm  requiring  both 
storage  and  running  time  which  are  polynomial  functions  only  of  the 
number  of  types  of  object  and  the  size  (weight)  of  a  single  object. 
This  is  a  significant  improvement  over  previous  algorithms  for  the 
solution  of  the  Knapsack  Problem  using  shortest  paths,  which  typically 
require  storage  and  runtime  which  are  functions  of  the  total  knapsack 
size. 

I.  INTRODUCTION 

The  Knapsack  Problem,  which  is  interesting  in  its  own  right,  has 
also  received  much  attention  recently  as  a  means  of  solving  integer 
programming  problems  via  a  relaxation  technique  using  group  theory. 
Formally,  the  Knapsack  Problem  can  be  stated  as: 

M 

Maximize  z  =  1  p-x. 

i=l  ‘  ^ 

♦This  work  was  partially  supported  by  NSF  under  Grant  ENG  7908120 
and  by  U.S.  Army  CECOM  under  Contract  DAAK-80-80-K-0579. 


-47- 


M 

subject  to  =  Z  w.x.  =  W 
i=l  ^  ^ 

X.  >0  for  i  =  1,2, ..  .M 
X.  integer 

Thus,  we  are  given  M  types  of  object  with  weight  w.  and  profitability 
p.  per  object  of  type  ..  We  are  required  to  maximize  the  total  profit 
subject  to  a  constraint  that  the  sum  of  the  weights  of  the  objects 
selected  equals  W.  In  the  simple  (unbounded)  version  of  the  problem 
there  is  no  restriction  on  the  number  of  objects  of  each  type  which 
may  be  included  in  the  solution. 

Sometimes  an  inequality  constraint  is  used  in  place  of  the  equal* 
ity  constraint,  i.e.,  one  seeks  objects  whose  aggregate  weight  does 
not  exceed  W.  Such  a  problem  can  be  transformed  into  a  problem 
with  an  inequality  constraint  by  including  an  additional  object  with 
unit  weight  and  zero  profitability.  Here  we  will  concentrate  primarily 
on  problems  with  equality  constraints. 

We  assume  that  W  and  the  w^  are  non-negative  integers  (or, 
more  generally,  that  they  are  commensurate)  and  that  the  Pj  are 
non-negative . 

Previous  approaches  to  the  solution  of  this  problem  include  those 
reported  by  Horowitz  and  Sahni^^^  using  dynamic  programming  and 
implicit  enumeration.  These  approaches  have  been  found  to  be  effec¬ 
tive  for  a  wide  range  of  problems  but  have  exhibited  excessive  run¬ 
times  for  problems  with  a  large  number  of  nearly  equally  valuable  (in 
terms  of  cost  per  unit  weight)  objects.  Also,  their  worst  case  run¬ 
ning  times  are  exponential  in  the  number  of  object  types.  Denardo 


and  Fox^  ^  have  developed  an  approach  which  overcomes  these  ob¬ 
jections  by  solving  the  problem  as  a  shortest  path  problem  in  a  net¬ 
work  with  W  nodes  and  W  x  M  arcs.  Thus,  their  approach  requires 
storage  proportional  to  W  and  runtime  proportional  to  W  x  m.  If  W 
and  M  are  sufficiently  small,  their  approach  is  extremely  attractive. 
If  W  is  large,  however,  the  storage,  and  to  a  lesser  extent  the  run¬ 
time,  become  prohibitive. 

Denardo  and  Fox  present  their  algorithm  in  the  context  of  using 
a  Knapsack  Problem  as  an  integer  programming  relaxation.  In  such 
cases,  W  is  generally  of  the  same  order  as  the  w..  We  extend  their 
approach  to  the  case  where  W  is. very  large  but  the  individual  w.  are 
not,  and  exploit  the  underlying  cyclic  group  structure  of  the  problem 
to  develop  an  algorithm  with  both  runtime  and  storage  requirements  a 
function  •  only  of  the  weight  of  a  single  object  and  the  number  of 
object  types.  Garfinkle  and  Nemhauser^"^^  point  out  the  relationship 
to  the  group  structure  but  do  not  discuss  application  of  the  structure 
to  develop  an  algorithm  as  efficient  as  the  one  presented  here. 

II.  BASIC  PROCEDURE 

For  the  sake  of  clarity,  we  begin  by  describing  the  basic  proce¬ 
dure  and  justifying  it.  In  later  sections,  extensions  and  refinements 
of  this  procedure  are  considered. 

We  begin  by  defin*  g  a  network  K  =  (N,A,L)  corresponding  to 
the  Knapsack  Problem  (KP).  The  node  set,  N,  contains  W  +  1  nodes 
numbered  0  through  W  corresponding  to  the  aggregate  weight  of  the 
objects  selected  thus  far.  At  each  node,  i,  there  are  outgoing  arcs 


corresponding  to  feasible  objects  which  may  be  selected  to  augment  a 
partial  solution  with  weight  i-  Thus,  at  node  i  there  will  be  arcs 
(i,  i  +  Wj^)  for  all  K  such  that  i  +  Wj^  <  W.  The  length  of  an  arc 
corresponding  to  an  object  of  type  K  is  pj^.  We  thus  have: 

N  =  {0.1,...  Wj 

A  =  {(i,  i  +  Wj^)!  i  =  0,1,... W,  for  all  k  such  that  i  +  Wj^  <  W} 

L  +  {L  !  (i,j)eA,  where  L.  =  p^  for  arcs  corresponding  to  type 

IJ  ij  K. 

k  objects} 


The  network  thus  contains  approximately  MW  arcs.  We  will  assume 
that  w,  ^  w.  for  ki  ^  ka  as  if  two  types  of  objects  have  the  same 
weight  the  one  with  the  smaller  profit  can  be  ignored.  We  also  assume 
Wj^  >  0  and  i  0  for  all  k.  We  will  refer  to  arcs  corresponding  to 
a  type  k  object  simply  as  type  k  arcs.  Figure  1  illustrates  part  of  a 
network  corresponding  to  a  KP  with  4  types  of  object,  described  in 
the  table  accompanying  Figure  1.  For  the  sake  of  clarity,  only  nodes 
0  through  7  and  arcs  from  nodes  0  and  1  are  shown  in  Figure  1. 

Paths  from  node  0  to  node  W  in  such  networks  correspond  to 
solutions  of  the  associated  KP.  The  length  of  a  path,  defined  as  the 
sum  of  the  lengths  of  the  arcs  in  the  path,  is  the  total  profit  of  the 
solution.  Thus,  the  longest  path  corresponds  to  the  optimal  solu¬ 
tions.  Note  that,  by  the  assumption  that  Wj^  >  0  for  all  k,  the  net¬ 
work  is  acyclic  and  so  the  problem  of  finding  the  longest  path  from  0 
to  W  is  solvable  using  Bellman's  shortest  path  algorithm  suitably  modi¬ 
fied  to  find  longest  paths.  Defining  dj  to  be  the  length  of  the  cur¬ 
rent  estimate  of  the  longest  p _ _  fr(  node  0  to  node  i,  this  amounts 

simply  to: 

that  w^  f  w.  for  Ki  f  K2  as 

Ki  K2 


I 


Algorithm  1 

Step  0:  do  ^  0 

d.  ^  »  i  =  1, _ . 

Step  1:  For  i  =  1  to  W 

For  all  j  such  that  (i,j)eA 
{d.  =  max  (d.,d.  +  Ijj) 

One  can  also  keep  track  of  the  solution  itself  by  setting  aj  i  when¬ 
ever  the  value  of  dj  is  updated.  At  the  end  of  the  algorithm  one  can 
trace  the  path  back  from  node  W  to  node  0  using  the  aj.  We  refer  to 
dj  as  the  label  on  node  j  and  to  aj  as  the  predecessor  of  node  j. 
Step  1  is  known  as  a  scan  of  each  node,  1. 

Algorithm  1  is  the  simplest  possible  shortest  path  algorithm. 
The  nodes  are  scanned  in  numerical  order  and  each  node  is  scanned 
only  once.  The  algorithm  has  a  running  time  on  the  order  of  MW 
operations  and  a  storage  requirement  on  the  order  of  W.  If  W  is 
very  large,  the  approach  becomes  unattractive.  The  approach  des¬ 
cribed  below  avoids  this  problem  by  solving  the  problem  using  a 
network  which  is,  in  general,  much  smaller.  The  key  to  reducing  the 
size  of  the  network  is  embodied  in  the  following  lemmas.  Define 

D*  P 

W  =  max  (w.  )  and  w*  =  w.  such  that  ^  =  max  k.  Thus  w  is  the 
k  ^  ^  k  Wj^~ 

weight  of  the  "heaviest"  type  of  object  and  w*  is  the  weight  of  the 
"best"  type  of  object. 

The  proofs  of  some  of  the  lemmas  rely  upon  elementary  proper¬ 
ties  of  cyclic  groups.  The  interested  reader  is  referred  to  [6]  for 
more  details. 
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Lemma  1:  An  optimal  KP  solution  exists  containing  no  set  of 
objects  whose  aggregate  weight  is  a  multiple  of  w*. 

Proof:  Given  an  optimal  KP  solution  containing  a  set  of  objects 
of  aggregate  weight  nw*,  we  can  replace  these  objects  by  n  "best” 
objects  without  altering  the  total  weight.  The  net  change  is  profit, 
Ap  is  defined  by: 

Ap  =  np*  -  I  p. 

j  ^ 

f  (-^  P*  -  Pj) 

By  the  definition  of  p*,  each  term  of  the  sum  is  non-negative 
and  hence  Ap  >  0.  We  can  thus  replace  the  set  without  destroying 
optimality  and  the  lemma  is  proven. 

The  terms  in  the  above  sum  can  be  interpreted  as  the  amount  of 

profit  "wasted"  by  using  a  type  j  object  instead  of  a  best  object.  We 

can  in  fact  redefine  the  arc  lengths  in  the  network  corresponding  to 

a  given  KP  to  be  precisely  these  quantities.  The  optimal  KP  solution 

then  corresponds  to  the  shortest  path  from  0  to  W.  This  can  be  seen 

to  be  true  by  observing  that  the  above  transformation  of  the  arc 

lengths  affects  the  lengths  of  all  paths  from  0  to  W  in  precisely  the 

same  way.  If  the  length  of  a  0  to  W  path  in  the  original  network  was 

w 

p  the  length  of  the  same  path  in  the  new  network  is  P*  -  P 
Thus  the  relative  length  of  all  0  to  W  paths  are  reversed.  Alterna¬ 
tively,  one  can  observe  that  the  optimal  path  is  the  one  which  wastes 
the  least  profit.  In  the  sequel,  we  use  this  form  of  the  length  func¬ 
tion  as  it  will  allow  us  to  improve  the  efficiency  of  the  algorithm.  We 
refer  to  the  network  using  this  length  function  as  Kj. 


In  Figure  1,  the  new  lengths  of  arcs  of  types  1  through  4  are 
1,2,0,  and  2,  respectively.  Note  that  type  3  arcs  are  best  and  that 
best  arcs  have  0  length.  All  arcs  have  non-negative  length.  If  the 
best  type  of  arc  is  not  unique  (e.g.  if  W2  =  9  in  the  above  example), 
it  is  possible  for  the  length  of  more  than  one  type  of  arc  to  be  0. 
To  avoid  confusion,  we  assume  that  the  best  type  of  object  is  unique. 
The  extension  of  the  algorithms  to  this  case  is  straight  forward. 

Lemma  2:  An  optimal  solution  exists  containing  no  more  than 
w*  -  1  non-best  objects. 

Proof:  Any  set  of  w*  or  more  objects  must  contain  a  subset  of 
weight  nw*  for  some  n  >  0.  To  see  this,  consider  a  set  of  w*  ob¬ 
jects,  ii,i2. •  • From  the  sums 
i 

S.  =  (  2  W,  )  modulo  W* 

J  .  k=l  ‘  k 

i 

If  S.  =  0  then  Z  W.  =  nw*. 

J  k=l  ^  k 

JL 

If  S.  =  S-  for  £  >  j  then  Z  W.  =  nw*. 

^  ^  k=j  ^  k 

But  there  are  w*  Sj's  and  only  w*  -  1  nonzero  values  of  modulo 

w*.  Hence,  either  S.  =  0  or  S.  =  S.  for  some  £  and  the  set  contains 

J  J  ^ 

a  subset  of  weight  nw*.  From  the  proof  of  Lemma  1  we  see  that  we 
can  replace  any  such  subset  by  n  best  elements  without  decreasing 
the  profit.  Thus,  we  need  only  consider  sets  containing  w*  -  1  or 
fewer  non-best  elements  together  with  zero  or  more  best  elements . 
An  optimal  solution  can  always  be  obtained  in  this  way. 


But  since  all  elements  have  weight  no  greater  than  w,  the  ag¬ 
gregate  weight  of  the  non -best  elements  in  any  such  optimal  solution 


will  not  exceed  W*(w*  -  1).  If  W  >  Ww*,  we  need  not  consider  W 
explicitly  as  a  constraint.  We  need  simply  apply  the  above  algorithm 
to  get  dj  for  j  <  ww*,  and  then  complete  the  solution  by  adding  the 
appropriate  number  of  best  objects. 

Indeed,  we  are  only  interested  in  the  weight  modulo  w*  of  a 
partial  solution  since  any  two  solutions  whose  weights  differ  by  a 
multiple  of  w*  differ  only  by  one  or  more  best  objects.  We  thus 
have: 

Lemma  3:  If  W  >  ww*,  the  optimal  KP  solution  can  be  found  by 
adding  zero  or  more  best  objects  to  the  set  of  objects  defining  the 
shortest  path  in  the  network  formed  by  merging  all  nodes  with  the 
same  weight  modulo  w*  as  described  above. 

Proof:  Consider  the  original  network  before  the  nodes  were 

merged.  Consider  the  0  to  W  path  corresponding  to  an  optimal  solu¬ 
tion.  By  Lemma  1,  we  may  assume  this  path  contains  no  set  of  arcs 
(correspond  to  objects)  of  weight  nw*  except  for  best  arcs.  Since 
the  network  contains  paths  corresponding  to  picking  these  objects  in 
all  possible  orders,  we  may  assume  that  the  path  under  consideration 

has  all  the  best  arcs  at  the  end.  Consider  only  the  initial  subpath 

f- 

corresponding  to  the  non-best  arcs.  f 

Now,  consider  what  the  transformation  does  to  this  initial  sub¬ 
path.  All  the  nodes  in  this  path  have  different  weights  modulo  w*. 

Thus  the  transformation  changes  this  optimal  0  to  W  path  in  Ki  to  a  0  « 

to  W'  path  in  the  network  with  merged  nodes,  where  W'  =  W  modulo 
w*.  Note  that  other  paths,  including  some  optimal  ones,  become 
complex  paths  containing  cycles  because  they  contain  subsets  whose  1 

aggregate  weights  are  a  multiple  of  w*.  The  path  we  are  focusing  on 
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however  remains  a  simple  path  and  can  thus  be  found  using  an  ordi¬ 
nary  shortest  path  algorithm. 

No  new  paths  are  created  by  the  transformation.  Thus,  an 
optimal  KP  solution  can  be  obtained  by  appending  0  or  more  best  arcs 
to  the  shortest  0  to  W'  path  in  the  new  network,  and  the  lemma  is 
proven . 

It  is  thus  possible  to  define  a  new  network 
Ka  =  (N.A,L): 

N  =  {0,l,...w*-l} 

A  =  {(i,j)  I  j  -  (i  +  Wj^)  modulo  w*  for  some  K} 

w 

L  =  {Aj.  |(ij)£  A,  A..  =  —  P*  -  Pk  for 
k  defining  (i,j)} 

This  network  has  only  w*  nodes  instead  of  W.  Furtliermore ,  we 

note  that  if  (W.  =  W,  )  modulo  w*,  then  type  ki  and  type  kg  arcs 
Ki  k.2 

are  parallel  and  the  arc  corresponding  to  the  object  with  the  smaller 
A.j  can  be  ignored.  Thus,  there  is  at  most  1  outgoing  arc  from  each 
node  to  each  other.  The  total  number  of  arcs  is  thus  at  most 
w*(w*  -  1).  We  have  thus  removed  dependency  on  both  M  and  W  and 
have  a  procedure  whose  running  time  and  storage  are  a  polynomials 
only  in  w*. 
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ALGORITHM  DESCRIPTION 


We  now  describe  the  actual  procedure  in  more  detail  and  analyze 
its  storage  and  runtime. 
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The  following  is  the  essence  of  the  procedure: 
Step  0:  (Initialization) 


do  =  o 

dj  =  »  j  =  1, . . .  w*-l 

where  dj  is  the  current  estimate  of  the  length  of  the  shortest  path 
from  0  to  j. 

Step  1:  Find  i,  the  best  node  to  scan  next.  This  is  discussed  in 
detail  below. 

Step  2:  (Scan  node  i) 

For  k  =  2,  . . .  M 

Let  j  =  (i  +  Wj^)  modulo  w* 

If  dj  >  d.  +  A.j  then  dj  =  dj  +  £.j  and  PRj  =  k 
where  il^j  is  the  length  of  an  arc  corresponding  to  a  type  k  object 
(iijj  =  Wj^(p*/w*  -  pj^)  as  defined  above)  and  PRj  records  the  type  of 
the  arc  used  to  label  node  j  and  hence  keep  track  of  the  optimal 
solution. 

Step  3:  Return  to  Step  1  if  any  nodes  remain  to  be  scanned;  other¬ 
wise  stop. 

It  is  clear  that  the  storage  required  for  this  procedure  is  linear 
in  w*  +  M.  If  Step  1  is  sufficiently  simple  and  each  node  is  only 
scanned  once,  the  procedure's  runtime  will  be  dominated  by  Step  2 
and  will  be  proportional  to  Mw*.  This  is  in  fact  the  case,  as  we  will 
see. 


A  node,  j,  must  be  scanned  if  its  label,  dj,  is  improved  (re¬ 
duced)  as  it  may  then  improve  the  labels  of  other  nodes.  If  we  are 
to  ensure  that  each  node  is  scanned  at  most  once,  we  must  defer 
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scanning  it  until  its  label  cannot  be  improved.  When  all  arc  lengths, 
are  positive,  as  they  are  here,  this  objective  is  achieved  by  scanning 
the  nodes  in  ascending  order  of  d. .  This  is  Dijkstra's  well  known 
shortest  path  algorithm. 

In  the  general  case,  Dijkstra's  algorithm  requires  that  in  Step  1 
we  find  the  as  yet  unscanned  node  with  the  smallest  label.  If  this 
were  done  naively ,  it  would  require  an  examination  of  w*  labels  and 
the  overall  procedure  would  require  (w*)^  +  Mw*  operations.  In  this 
case,  however,  we  can  implement  Step  1  carefully  and  do  much  bet¬ 
ter. 

We  form  "buckets"  of  nodes  to  be  scanned  and  place  nodes  to  be 
scanned  together  in  the  same  bucket  if  they  have  similar  labels.  If 
the  width  of  the  buckets  is  q,  then  we  place  a  node  with  label  d, 
where  (k-l)'  <  d  <  kq  in  the  k^^  bucket.  We  then  simply  work  our 
way  through  the  buckets  in  ascending  order  of  bucket  number  scan¬ 
ning  the  nodes  in  each  bucket  in  arbitrary  order.  Thus,  there  is  no 
explicit  search  in  Step  1.  If  the  width  of  the  bucket  is  no  greater 
than  the  length  of  the  shortest  arc  in  the  graph,  then  any  node 
labeled  by  the  node  currently  being  scanned  will  reside  in  a  higher 
numbered  bucket  and  hence,  each  node  will  be  scanned  at  most  once. 

The  only  problem  with  this  procedure,  which  is  well  known,  is 
that  the  number  of  buckets  itself  may  become  very  large,  in  particu¬ 
lar  much  larger  than  the  number  of  nodes,  if  the  longest  arc  is  many 
times  greater  than  the  shortest.  In  this  case,  a  great  deal  of  storage 
and  running  time  may  be  wasted  in  dealing  with  empty  buckets.  We 
now  show  that  in  this  case  fewer  than  w*  buckets  are  required  in  all 


cases. 


We  begin  by  reindexing  the  arcs  out  of  each  node  corresponding 
to  the  M  ~  1  remaining  types  of  object  in  increasing  order  of  length. 
(Recall  that  the  arc  corresponding  to  the  best  type  of  object  was  re¬ 
moved  from  explicit  consideration.)  Thus,  Li,  is  the  length  of  the 
arc  which  corresponds  to  the  second  best  type  of  object,  etc.  The 
Wj  and  Pj  are  reindexed  correspondingly.  We  will  refer  to  arcs  of 
length  L.,  weight  w^,  and  profit  p^  after  this  reindexing  as  arcs  of 
type  i  (or  as  arcs  corresponding  to  objects  of  type  i).  For  sim¬ 
plicity,  we  will  assume  L.  Lj  for  i  j  although  the  procedure  does 
not  require  this. 

Consider  the  case  where  w*  is  relatively  prime  to  Wi.  By  the 
definition  above,  arcs  of  length  Li  are  the  shortest  arcs  in  the 
graph.  Since  Wi  and  w*  are  relatively  prime,  a  path  containing  at 
most  w*  -  1  arcs  comprised  entirely  of  arcs  of  type  1  exists  from  0  to 
each  other  node.  Thus,  we  know  that  no  node  need  have  a  label 
greater  than  Li(w*  -  1).  So,  w*  -  1  buckets  of  width  Li  suffice 
since  the  shortest  path  will  be  no  longer  than  the  aforementioned  one. 

In  the  case  where  w*  and  Wi  are  not  relatively  prime,  the  above 
path  will  loop  back  to  node  0  before  reaching  many  of  the  other 
nodes.  In  this  case,  the  following  preliminary  computation  is  carried 
out  before  setting  up  the  buckets: 

r  =  w* 

For  j  =  1,  M-1 

gj  =  r/GCD(r,w.) 
r  =  GCD(r,w.) 

where  GCD(i,j)  is  the  greatest  common  divisor  of  the  integers  i  and  j 
and  can  be  found  using  Euclid's  Algorithm,  whose  running  time  is 
bounded  by  the  following  Lemma: 
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Lemma  4:  Euclid's  Algorithm  has  a  worst  case  running  time  ot 
order  log  N  when  applied  to  find  the  GCD  of  N  and  M,  for  N  >  M. 

Proof :  For  N  >  M,  Euclid's  Algorithm  replaces  a  problem  on  N 

and  M  by  one  one  M  and  N  -  qM.  The  worst  case  for  this,  i.e.,  the 
one  which  converges  most  slowly,  is  when  q  =  1  at  all  stages.  This 

results  in  a  Fibonacci  sequence.  It  is  well  known  [1]  that  the 

Fibonacci  number,  Fj^  is  given  by: 


Thus,  the  runtime  of  the  algorithm  is  of  order  K  for  N  of  order 
Fj^,  i.e.,  K  is  of  order  log  N. 

We  observe  that,  corresponding  to  the  paths  comprised  entirely 
of  type  1  arcs  above,  there  is  now  a  tree  rooted  at  0,  spanning  all 
the  nodes,  and  containing  paths  with  at  most  gj  -  1  arcs  of  length 
Lj.  An  illustration  of  such  a  tree  is  given  in  Figure  4.  In  this  case 
only  nodes  0i8 ,  and  4  can  be  reached  using  only  arcs  of  type  1 
before  the  path  cycles  back  to  node  0.  The  number  of  nodes  so 
reachable  is  gi  (three  in  this  case).  The  original  set  of  w*  nodes  is 
partitioned  into  gj  parts  which  are  then  each  partitioned  into  ga 
smaller  parts  corresponding  to  the  nodes  which  are  now  reachable  via 
arcs  of  type  2.  The  partitioning  continues  until  all  nodes  are 
reached.  The  above  can  be  proven  rigorously  in  the  general  case 
using  the  properties  of  cyclic  groups. 

We  now  define  buckets  of  variable  width  corresponding  to  the 
longest  path  in  the  tree  defined  above.  Thus,  there  are  gi  -  1 


V# 
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buckets  of  width  li  followed  by  g2  -  1  buckets  of  width  I2,  etc.  A 
total  of  B  =  ^  (g.  -  1)  buckets  is  required.  Since  the  product  of  the 
g.'s  is  w*,  B  <  w*. 

We  have  shown  that  less  than  w*  buckets  are  required.  We  now 
show  that  no  node  is  scanned  more  than  once.  To  do  so,  we  show 
that  the  width  of  each  bucket  is  no  greater  than  the  length  of  the 
shortest  arc  still  permitted  in  a  path. 


Note  that  the  paths  from  0  to  j  correspond  to  ordered  selections 
of  objects  of  aggregate  weight  j;  i.e.,  there  are  distinct  paths  cor¬ 
responding  to  the  selection  of  the  same  objects  in  different  orders. 
We  can,  and  should,  consider  only  one  path  corresponding  to  each 
distinct  collection  of  objects.  This  is  easily  done  by  keeping  track  of 
the  last  type  of  arc  in  the  path  (which  we  are  doing  with  the  variable 
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PRj)  and  only  considering  arcs  with  the  same  or  higher  index  (object 
type).  Thus,  we  will  find  solutions  corresponding  to  a  selection  of 
objects  in  non-decreasing  order  of  type. 

Now,  consider  a  node  in  bucket  i  of  width  Wj.  This  node  was 
either  reached  via  a  path  in  the  tree  considered  above  or  via  an  arc 
whose  length  is  greater  than  that  of  any  in  that  tree.  In  latter  case, 
the  length  of  the  smallest  arc  under  consideration  exceeds  the  width 
of  any  bucket.  In  the  former  case,  since  we  are  considering  arcs  in 
increasing  order  of  length,  again  a  node  can  only  enter  a  bucket  via 
an  arc  at  least  as  long  as  the  width  of  the  bucket. 

Extensions 

If  W  <  w*\^  then  it  is  possible  for  w.  to  be  equal  to  w^  and  for 
objects  i  and  j  not  to  dominate  one  another.  In  particular,  if  w.  >  w. 
and  £.  <  ilj  neither  i  nor  j  may  dominate.  This  is  because  the  con¬ 
straint  on  W  may  prohibit  the  use  of  a  sufficient  number  of  objects 
with  the  smaller  length.  As  an  example  of  this,  consider  a  problem 
with  W  =  24  and  (W.,P.)  of  (5,50),  (6,58),  (11,109),  (1,0).  Here, 

the  Uj  are  0,2,1,  and  10,  and  W2  =  W3  modulo  5.  Neither  object  2 

nor  object  3  dominates  the  other,  however.  In  fact,  the  optimal 
solution  is  4  type-2  objects  in  this  case.  For  W  =  21  or  W  =  22, 
however,  type-3  objects  would  be  used  in  the  optimal  solution. 

For  W  =  23,  both  type-2  and  type-3  objects  would  be  used.  Lemma 

3  can  thus  not  be  extended  directly  to  the  case  whose  W  <  w*w. 

Thus,  it  is  not  possible  to  make  the  final  reduction  to  a  network 
with  w*  nodes  in  this  case.  However,  when  the  individual  w.  are 
small,  which  is  the  principal  case  of  interest  here,  w*w  will  be  small 
as  well.  The  above  algorithm  can  then  still  be  applied  effectively  to 
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the  network  with  W  nodes.  Alternatively,  one  can  view  the  following 
procedure  as  working  on  the  reduced  graph  with  w*  nodes  but  allow¬ 
ing  multiple  labels  on  each  node.  We  adopt  this  latter  point  of  view 
in  describing  the  modified  procedure  below. 

A  label  on  a  node,  i,  is  now  a  couple  (dj,  Cj)  where  d^  is  the 
amount  of  wasted  profit  to  get  to  node  i,  i  =  1,2,...  w*-l,  and  c.  is 
the  cycle  number  of  this  label.  The  c.  are  defined  by  the  relation 
i  =  i  -  w*c. 

where  j  is  the  node  in  the  network  Kj  (with  W  nodes)  which  would 
have  received  the  same  label.  A  node  i  in  K2  can  thus  have  several 
labels  corresponding  to  different  c.,  but  for  any  two  labels  (dj,c.) 
and  (dj',cp  on  the  same  node  i, 

d'  >  d.  c.'  <  c. 

1  i  1  1 

since  as  we  noted  above  the  only  reason  for  considering  a  larger  d. 
would  be  to  obtain  a  smaller  c..  Similarly,  for  any  pair  of  objects 
with  weights  having  the  same  residue  modulo  w*  the  object  with  the 
larger  weight  must  have  the  smaller  length. 

There  are  several  limitations  on  the  maximum  number  of  objects 

of  any  given  type.  In  particular,  m^,  the  maximum  number  objects  of 

type  i,  is  bounded  by 

I'w  1  w*  / 

vT  '  GCD(w*;rp’^| 

where  r^  is  the  residue  of  w.  modulo  W*.  The  first  term  on  the  right 
hand  side  follows  from  the  fact  that  any  larger  number  of  type  i 
objects  have  weight  greater  than  W.  The  second  term  follows  by  the 
some  reasoning  is  Lemma  2. 
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Proceeding  along  the  lines  similar  to  those  in  [3]  we  create 
(loga  Mp  +  1  objects  corresponding  to  1,2, 4, 8,...  and  2^  objects  of  ^ 

type  i,  where  b  is  largest  power  of  2  that  is  less  than  M..  We  thus 

now  have  defined  a  new  problem  with  at  most  M  loga  W  objects.  At 
most  one  object  of  each  type  is  permitted  in  the  solution.  I 

Proceeding  along  lines  similar  to  those  in  [3],  we  can  create 
(loga  Mp  +  1  objects  corresponding  to  1,2, 4, 8... 2  objects  of  type 
i,  where  b  is  the  largest  power  of  2  that  is  less  than  M..  We  then  I 

have  a  problem  with  at  most  =  Mloga(w*)  objects  where  at  most 

one  object  of  each  type  need  be  considered.  Thus,  we  could  create 
buckets  and  proceed  with  the  algorithm  as  above.  I 

One  can  refine  this  procedure  by  eliminating  dominated  objects 
from  consideration.  In  particular,  if  two  objects  i  and  j,  have  the 
same  residue  modulo  w*,  and  w.  >  w.  and  £.  <  Jt.  then  object  j  may  be  I 

eliminated  from  further  consideration.  One  must  be  careful,  however, 
not  to  allow  one  copy  of  an  object  to  eliminate  another  copy  of  the 
same  object.  * 

For  example  suppose  there  were  initially  4  types  of  object  with 
(Wi,Pi)  equal  to  (10,20),  (3,5),  (7,4)  and  (1,0)  W  =  10  and  W  <  w*, 
respectively.  If  W  =  37  we  must  consider  the  modified  procedure.  * 

We  thus  create  objects  with  (W.,pp  equal  to  (6,10),  (12,20),  and 
(6,10)  corresponding  to  2,4,  and  2  objects  of  type  2.  These  three 
new  objects  together  with  the  original  object  of  type  2  can  be  used  to  - 

select  anywhere  from  0  to  9  copies  of  the  type  2  object.  The  first 
new  object  is  identical  to  the  third  and  would  dominate  it  unless  we  ^ 

specifically  prohibited  this. 
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If  the  and  P.  are  drawn  independently  from  uniform  distribu¬ 
tions,  this  latter  refinement  will  reduce  the  average  number  of  objects 
to  no  more  than  w*£n(M)  objects  and  buckets  as  we  see  from  Lemma 
5. 

Lemma  5:  There  will  be  on  the  average  on  the  order  of  no  more  than: 


objects  left  undominated  after  dominated  objects  are  removed  in  the 
above  procedure. 

Proof:  As  observed  above,  there  are  approximately  Mlog2(W*)  or 

fewer  objects  before  dominance  is  checked.  Suppose  these  objects 
divide  evenly  into  residue  classes.  There  would  then  be  Mlog2(w*)/w* 
objects  in  each  residue  class. 

An  object  can  be  dominated  by  any  other  object  in  the  same 
class.  If  the  objects  are  ordered  in  increasing  order  of  £.,  an  object 
will  be  dominated  unless  its  Wj  is  less  than  the  w.  of  all  objects  pre¬ 
ceding  it  on  the  list  since  the  weights  are  uniformly  distributed,  the 
item  in  the  list  will  be  undominated  with  probability  Thus, 
given  a  list  of  length  L  the  expected  number  of  undominated  elements 
remaining  on  the  list  is 
L 

1  1 

j=l  1  £n(L) 

The  result  follows  directly  by  substituting  for  L.  If  the  residue 
classes  do  not  all  contain  the  same  number  of  elements,  the  result  is 
still  true  since  we  observe  that  the  quantity 
£n(Li)  +  £n(L2)  for  Li  +  L2  =  2L 


is  maximized  for  Li  =  L2  =  L,  that  is,  equal  sized  residue  classes 
yield  the  maximum  number  of  objects. 

Thus,  even  for  W  <  w*w,  we  have  a  number  of  buckets  polynomial 
in  w*,  not  W.  The  running  time  is  thus  at  worst  of  orderWw*  <  (w*)2w. 

It  is  possible  to  extend  the  procedure  still  further  to  take  into 
account  restrictions  on  the  number  of  permissible  objects  of  each 
type,  for  some  or  all  of  the  objects.  First,  if  one  is  given  an  a 
priori  restriction,  U.,  on  the  number  of  objects  of  type  i,  one  need 
only  set 

Mj  =  Min  (M.,U.) 

where  Mj  is  the  maximum  number  of  objects  of  type  is  permitted  and 
the  Mj  on  the  right  hand  side  is  computed  as  above.  If  the  a  priori 
maximum  permissible  number  of  best  objects  is  less  than  W/w*  than  a 
fictitious  best  object  with  w*  =  W  +  1  must  be  added  to  the  problem. 
This  maintains  the  validity  of  the  procedure  but  may  reduce  its 
efficiency  considerably  if  the  original  w*  was  much  smaller  than  W. 


CONCLUSIONS 

We  have  presented  an  algorithm  for  the  solution  of  the  simple 
(unbounded)  Knapsack  Problem  whose  running  time  and  storage  are 
functions  only  of  the  size  of  the  "best"  object  weight,  W*  in  the  case 
where  the  overall  weight  constraint  is  sufficiently  large.  In  the  cases 
where  the  overall  weight  constraint  is  smaller  or  where  restrictions 
exist  on  the  number  of  objects  of  a  given  type,  we  present  an  exten¬ 
sion  of  this  procedure  which  in  practice  is  often  very  efficient  and 
whose  worst  case  running  time  is  no  greater  than  (w*)2w  where  w  is 
the  weightiest  object. 
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ABSTRACT 

We  consider  the  simple  (unbounded)  Knap¬ 
sack  Problem  and  show  that  it  can  be  solved 
using  a  shortest  path  algorithm  requiring  both 
storage  and  running  time  which  are  polynomial 
functions  only  of  the  number  of  types  of  object 
and  the  size  (weight)  of  the  single  largest 
object.  This  is  a  significant  improvement  over 
previous  algorithms  for  the  solution  of  the 
Knapsack  Problem  using  shortest  paths,  which 
typically  require  storage  and  runtime  which  are 
functions  of  the  total  knapsack  size. 

INTRODUCTION 

The  Knapsack  Problem,  which  is  interest¬ 
ing  in  its  own  right,  has  also  received  much 
attention  recently  as  a  means  of  solving  integer 
programming  problems  via  a  relaxation  technique 
using  group  theory.  Formally,  the  Knapsack 
Problem  can  be  stated  as: 

M 

Maximize  z  «  I  p.x, 

1=1  *  * 

M 

subject  to  Z  w.X|  =  W 
1=1  *  ^ 

Xj  >  0  for  i=l,2,...M 
Xj  integer 

Thus,  we  are  given  M  types  of  object  with 
weight  Wj  and  profitability  Pj  per  object  of  type 

i.  We  are  required  to  maximize  the  total  profit 
subject  to  an  constraint  that  the  sum  of  the 
weights  of  the  objects  selected  equals  W.  In 
the  simple  (unbounded)  version  of  the  problem 
which  we  consider  here,  there  is  no  restriction 
on  the  number  of  objects  of  each  type  which 
may  be  included  in  the  solution. 

Sometimes  an  inec^ality  constraint  is  used 
in  place  of  the  equality  constraint  considered 
here;  i.e.,  one  seeks  objects  whose  aggregate 
weight  does  not  exceed  V.'.  Such  a  problem  can 
be  transformed  into  a  problem  with  an  inequal¬ 
ity  constraint  by  including  an  additional  object 
with  unit  weight  and  zero  profitability.  Here 
we  will  concentrate  primarily  on  problems  with 
equality  constraints. 

We  assume  that  W  and  the  w.  are  non- 
negative  integers  (or,  more  generally*,  that  they 

*  This  work  was  supported  by  NSr  Grant 
ENG7908120. 
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are  commensurate)  and  that  the  p,  are  non- 
negative  . 

Previous  approaches  to  the  solution  of  this 
problem  include  those  reported  by  Horowitz  and 
Sahni  [1]  using  dynamic  programming  and  impli¬ 
cit  enumeration.  These  approaches  have  been 
found  to  be  effective  for  a  wide  range  of  prob¬ 
lems  but  have  exhibited  excessive  runtimes  for 
problems  with  a  large  number  of  nearly  equally 
valuable  (in  terms  of  cost  per  unit  weight) 
objects.  Also,  their  worst  case  running  times 
are  exponential  in  the  number  of  object  types. 
Denardo  and  Fox  [2]  have  developed  an  ap¬ 
proach  which  overcomes  these  objections  by 
solving  the  problem  as  a  shortest  path  problem 
in  a  network  with  W  nodes  and  Wxm  arcs. 
Thus,  their  approach  requires  storage  propor¬ 
tional  to  W  and  runtime  proportional  to  WxM.  If 
W  and  M  are  sufficiently  small,  their  approach  is 
extremely  attractive.  If  W  is  large,  however, 
the  storage,  and  to  a  lesser  extent  the  runtime, 
become  prohibitive. 

Denardo  and  Fox  present  their  algorithm  in 
the  context  of  using  a  Knapsack  Problem  as  'an 
integer  programming  relaxation.  In  such  cases, 
W  is  generally  of  the  same  order  as  the  w^.  We 

extend  their  approach  to  the  case  where  W  is 
very  large  but  the  individual  Wj  are  not,  and 

exploit  the  underlying  cyclic  group  structure  of 
the  problem  to  develop  an  algorithm  with  both 
runtime  and  storage  requirements  a  function  only 
of  the  weight  of  the  largest  single  object  and  the 
number  of  object  types. 

BASIC  PROCEDURE 

For  the  sake  of  clarity,  we  begin  by  des¬ 
cribing  the  basic  procedure  informally.  Exten¬ 
sions,  refinements  and  a  more  formal  description 
are  given  in  later  sections. 

Given  a  particular  Knapsack  Problem  (i.e., 
given  values  for  W,  M  and  the  pairs  (Pj.wp, 

i=l _ M)  we  can  construct  a  graph,  G,  with 

nodes  j=0,l,...W  and  directed  arcs  e=(i,m) 
where  m=itWj^  for  some  k.  There  is  an  arc  in  G 

corresponding  to  each  i<W  and  each  k<M  pro¬ 
vided  m<W.  Figure  1  shows  such  a  graph  for 
M=3,  W=7,  and  the  (pj,  Wj)  =  (3,2),  (5,3), 

(10,5).  The  length  of  each  arc  (shown  next  to 
Uie  arc)  is  the  profit  obtained  by  selecting  the 
type  of  object  corresponding  to  the  arc.  Node 
numbers  correspond  to  weight  used  up  by  the 
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ob|Pcts  already  selected,  a  path  from  node  0  to 
node  j  corresponds  to  a  s(. lection  ol  obiects  with 
aggregate  weight  j  and  the  length  of  the  path 
(the  sum  of  the  lengths  of  the  arcs  in  the  path) 
corresponds  to  the  profit  obtained.  There 
exists  a  path  corresponding  to  any  feasible 
selection  of  objects  and  hence,  the  longest  path 
from  0  to  W  in  G  corresponds  to  the  optimal 
solution  to  the  Knapsack  Problem.  This  is  the 
essence  of  the  shortest  path  technique  de¬ 
scribed  by  Denardo  and  Fox  among  others. 

It  is  possible  to  accomplish  the  same  opti¬ 
mization,  however,  using  a  graph  containing  a 
much  smaller  number  of  nodes.  Johnson  |3j 
observed  that  it  is  not  in  general  necessary  to 
label  all  nodes  in  order  to  obtain  the  optimim 
solution,  but  rather,  that  one  may  stop  when 
node  j=w*w  has  been  reached,  where  w=  max 

(w.)  and  p*/w*=  max  (Pj/w,)  ;  i.e.,  vJ  is  tAe 
J  j  J  J 

weight  of  the  "heaviest"  object  and  w*  is  the 
weight  of  the  "best"  object.  We  will  explore 
this  observation  and  exploit  it  in  developing  a 
new,  more  efficient  algorithm. 

The  key  to  proving  the  validity  of  the 
above  observation  is  that  if  an  arbitrary  solu¬ 
tion  contains  one  or  more  objects  whose  aggre¬ 
gate  weight  is  a  multiple,  say  q,  of  w*,  then  a 
new  solution  no  worse  than  the  first  can  be 
obtained  by  replacing  these  objects  by  q  best 
objects.  Such  a  replacement  leaves  the  total 
weight  unaltered  and  cannot  decrease  the  total 
profit.  Thus,  we  need  only  consider  partial 
solutions  which  do  not  contain  any  objects 
whose  aggregate  weight  is  a  multiple  of  w*. 
The  appropriate  number  of  best  objects  can  be 
added  to  complete  any  such  solution  at  the  end 
of  the  procedure. 

There  can  be  at  most  w*-l  objects  in  any 
such  partial  solution,  lo  see  this,  consider  a 
partial  solution  containing  b  objects  with  weights 

w(l),  w(2),  ...  w(b).  form  the  sums  Si=(2w(i)) 

>  i=l 

modulo  w*.  The  s.  must  all  be  different  and 

must  all  be  integers  in  the  range  1  to  w*-l. 
Otherwise,  the  b  objects  contain  a  subset  of 
aggregate  weight  qw*  since  : 


and 


s.  =  0  Jw(i)  =  qw* 
*  1»1 


k 

s ,  =  s,,  ■*  2w(i)  =  qw*  for  j  <  k 
J  “  l=j+l 


Note  that  It  is  in  fact  possible  for  such  a 
partial  solution  to  contain  w*-l  objects,  for 
instance  w*-l  objects  iurtth  weight  Wj  when  w. 

and  w*  are  relatively  prime  (i.e.,  contain  no 
common  factors  greater  than  1),  so  this  bound 


on  I  hr  maximum  number  of  objects  is  tight  at 
least  in  some  cases. 

Since  w  is  the  weight  of  the  largest  ob¬ 
ject,  Johnson's  observation  is  proven.  Thus, 
one  could  replace  G  by  another  graph  with  a 
similar  structure  but  with  only  w(w*-l)tl  nodes 
and  thus  improve  both  the  memory  and  runtime 
performance  of  the  procedure  if  the  new  number 
of  nodes  is  less  than  W. 

Using  similar  reasoning,  we  go  substantially 
further.  First,  we  redefine  the  lengths  of  the 
arcs  in  G  above  to  represent  the  amount  of 
wasted  profit  relative  to  a  (not  necessarily 
realizable)  path  comprised  entirely  of  best  arcs. 
Note  that  this  redefinition  of  arc  lengths  re¬ 
verses  the  previous  ordering  of  the  lengths  of 
different  paths  between  nodes  0  and  any  other  j. 
The  optima!  solution  is  thus  now  the  shortest 
path  from  0  to  W  in  the  new  G,  with  arc  lengths 
given  by  Wj^  {p*/w*  -  pj^)  for  arcs  which  cor¬ 
respond  to  type  k  objects.  Note  that  all  arcs 
have  non-negative  length  and  best  arcs  are  the 
only  ones  that  have  zero  length.  For  simplicity, 
we  will  assume  that  the  best  arc  type  is  unique; 
the  case  of  multiple  best  arc  types  is  in  fact 
easy  to  handle  in  the  procedure. 

We  now  note  that  every  pair  of  nodes  j  and 
j+w*  in  G  are  connected  by  a  zero  length  arc 
and  hence  that  if  a  path  of  length  L  from  0  to  j 
exists  then  a  path  of  length  L  from  0  to  j^w* 
exists  too.  For  W  sufficiently  large,  in  parti¬ 
cular,  for  W>w*w,  it  is  sufficient  to  consider  a 
new  graph,  H,  with  only  w*  nodes  corresponding 
to  the  original  node  numbers  modulo  w*.  This 
gives  rise  to  a  graph  of  the  type  shown  in 
Figure  2  for  the  same  example  giver,  in  Figure  1 
except  that  W  is  now  assumed  to  be  much  larger. 
(This  graph  is  in  fact  a  Cayley  color  graph  for 
the  cyclic  group  of  order  w*.) 


Figure  1.  A  graph  for  Figure  2.  Cyclic  graph 
solving  the  for  Knapsack 

Knapsack  Problem 

Problem 
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Note  that  this  graph  contains  arcs  both 
from  higher  numbered  nMes  to  lower  numbered 
nodes  and  vice  versa,  unlike  G  which  contained 
only  arcs  from  lower  n\imbered  nodes  to  higher 
numbered  nodes.  Also,  note  that  arcs  appear  as 
zero  length  self  loops  and  can  thus  be  ignored. 
Since  aU  remaining  arcs  are  of  positive  length, 
aU  cycles  are  of  positive  length  and  the  shortest 
path  problem  is  well  defined.  Thus,  we  have 
almost  reduced  the  problem  to  one  of  finding  a 
shortest  path  in  a  graph  with  w*  nodes  instead 
of  W  nodes,  the  resultant  path  augmented  by  the 
appropriate  number  of  bMt  arcs  to  form  the 
optimal  solution. 

The  only  loose  end  to  tie  up  is  what  to  do 
in  the  case  where  W  <  tir*W.  In  this  case,  it  is 
possible  that  the  shortest  path  found  in  H  above 
may  correspond  to  a  path  to  a  node  greater  than 
W;  l.e.,  the  "appropriate"  number  <2  best  arcs 
is  negative.  Put  another  way,  the  reduction 
from  G  to  H  was  based  on  the  assumption  that 
node  J  is  dominated  by  node  ]-h«*  in  G  since  the 
lower  numbered  node  could  almys  extend  Its 
paths  through  the  higher  numbered  node  via  a 
zero  length  arc.  This  is  not  true,  however,  if 
)«w*  >  W-W  since  a  path  to  W  may  exist  from  the 
lower  numbered  node  but  not  from  the  higher 
numbered  node. 

We  can  handle  this  problem  by  adding  an 
additional  W  nodes  to  H,  corresponding  to  the 
nodes  ...  w  in  G  and  adding  arcs  from 

appropriate  nodes  0,  ...  w*-l  to  these  new 
nodes  corresponding  to  each  type  of  object. 
Finally,  we  add  arcs  between  lower  numbered 
nodes  and  higher  numbered  nodes  correspond* 
ing  to  each  type  of  object,  exactly  as  in  G. 
This  entire  transformation,  which  at  first  ap¬ 
pear  complex,  amounts  simply  to  collapsing  G 
by  removing  the  nodes  in  the  range  w*  through 
W-W  since  the  removed  nodes  are  not  essential 
to  finding  the  optimal  solution. 

Figure  3  shows  part  of  such  a  graph  in  a 
case  where  vr*«S,  W^,  and  Ws22.  Shown  are 
only  the  arcs  corresponding  to  an  object  with 
weight  2.  Thus,  even  with  this  complication, 
both  the  runtime  and  storage  have  been  re¬ 
duced  to  a  polynomial  in  M,  w*,  and  W;  i.e., 
there  Is  no  functional  dependence  on  W.  Note 
that  if  W  <  w*tW  then  we  simply  solve  the 
problem  using  G;  no  reduction  is  necessary. 

In  the  sequel,  we  arlU  limit  the  discussion 
to  the  ease  where  W  >  The  ease  where  W 

<  w*W  foUows  by  the  above  transformation, 
adding  storage  and  runtime  proportional  to  W. 
The  final  phase  of  the  procedure  in  this  case  is 
an  ordinary  shortest  path  aloorithm  where 
nodes  w*  timugh  are  eau  scanned  se¬ 
quentially. 

ALGORITHM  DESCRIPTION 

We  now  describe  the  actual  procedure  in 
more  deti^  and  analyze  its  storage  and  runtime. 


Figure  3.  A  graph  for  solving,  the  Knapsack 
Problem  when  W  <  w*  w 


The  foUoedng  is  the  essence  of  the  procedure: 
Step  0:  (Initialization) 

’dg  «  0 

dj  «  •  i=l,...  w«-l 

where  dj  is  the  current  estimate  of 
length  of  the  shortest  path  from  0‘to 


Step  1:  Find  i,  the  best  node  to  scan  next. 

This  is  discussed  in  detail  below. 

Step  2:  (Scan  node  i) 

For  kB2,  ...  M 

Let  j®(l+w^)  modulo  w* 

If  dj>dj  ♦  L|^  then  dj<Hl|  *  L^  and 
PRj«k 

where  L^  is  the  length  of  an  arch 

corresponding  to  a  type  k  object 
(Lk*Wk(p*/w*  -  pjj)  as  defined  above) 
and  PRj  records  the  type  of  the  arc 

used  to  label  node  j  and  hence  keep 
track  of  the  optimal  raution. 

Step  3:  Return  to  step  1  if  any  nodes  remain 
to  be  scanned;  otherwise  stop. 

It  is  clear  that  the  storage  required  fqr 
this  procedure  is  linear  in  w*^M.  If  Step  1  is 
sufficiently  simple  and  each  node  is  only  scanned 
once,  the  procedure's  runtime  will  be  dominated 
by  Step  2  and  will  be  proportional  to  Mw*.  This 
is  in  fact  the  case,  as  we  will  see. 


A  node,  j,  must  be  scanned  if  its  label, 
dj,  IS  improved  (reduced)  as  It  may  then  im¬ 
prove  the  labels  of  other  nodes.  If  we  arc  to 
ensure  that  each  node  is  scanned  at  most  once, 
we  must  defer  scanning  it  until  its  label  cannot 
be  improved.  When  all  arc  lengths  are  posi¬ 
tive,  as  they  are  here,  this  objective  is  achiev¬ 
ed  by  scanning  the  nodes  in  ascending  order  of 
d..  This  IS  Dijkstra's  well  known  shortest  path 

algorithm. 

In  the  general  case,  Dijkstra's  algorithm 
requires  that  in  Step  1  we  find  the  as  yet  un¬ 
scanned  node  with  the  smallest  label.  If  this 
were  done  naively,  it  would  require  an  examina¬ 
tion  of  w*  labels  and  the  overall  procedure 
would  require  (w*>*  ♦  Mw*  operations.  In  this 
case,  however,  we  can  implement  Step  1  care¬ 
fully  and  do  much  better. 

We  form  "buckets"  of  nodes  to  be  scanned 
and  place  nodes  to  be  scanned  together  in  the 
same  bucket  if  they  have  similar  labels.  If  the 
width  of  the  buckets  is  q,  then  we  place  a  node 

with  label  d,  where  (k-1)  <  d  <  kq  in  the  k^ 
bucket.  We  then  simply  work  our  way  through 
the  buckets  in  ascending  order  of  bucket  num¬ 
ber  scanning  the  nodes  in  each  bucket  in 
arbitrary  order.  Thus,  there  is  no  explicit 
search  in  Step  1.  If  the  width  of  the  bucket  is 
no  greater  than  the  length  of  the  shortest  arc 
in  the  graph ,  then  any  node  labeled  by  the 
node  cunently  being  scanned  will  reside  in  a 
higher  numbered  bucket  and  hence,  each  node 
will  be  scanned  at  most  once. 

The  only  problem  with  this  procedure, 
which  is  well  known,  is  that  the  number  of 
buckets  itself  may  become  very  large,  in  par¬ 
ticular  much  larger  than  the  number  of  nodes, 
if  the  longest  arc  is  many  times  greater  than 
the  shortest.  In  this  case,  a  great  deal  of 
storage  and  running  time  may  be  wasted  in 
dealing  with  empty  buckets.  We  now  show  that 
in  this  case  fewer  than  w*  buckets  are  required 
in  all  cases. 

We  begin  by  reindexing  the  arcs  out  of 
each  node  corresponding  to  the  M-1  remaining 
types  of  object  in  increasing  order  of  length, 
(hecall  that  the  arc  corresponding  to  the  best 
type  of  arc  was  removed  from  explicit  constd- 
eralion.i  Thus,  L^,  is  the  lenth  of  the  arc 

whici.  corresponds  to  the  second  best  type  of 
object,  etc.  The  wv  and  p^  are  reindexed 

correspondingly.  Wc  will  refer  to  arcs  of 
length  L|,  weight  w^,  and  p.'-ofit  p^  after  this 

reindexing  as  arcs  of  type  i  (or  as  arcs  cor¬ 
responding  to  objects  of  type  i).  Tor  sim¬ 
plicity,  we  will  assume  b|  ¥  L|  for  i^j  although 

the  procedure  does  not  require  this. 

Consider  the  case  where  w*  is  relatively 
prime  to  w^ .  by  the  definition  above,  arcs  of 


length  1,.  are  the  shortest  arcs  in  the  graph. 
Since  wl  relatively  prime,  a  path 

containing  at  most  w*-]  arcs  comprised  entirely 
of  arcs  of  type  1  exists  from  0  to  each  other 
nude.  Thus,  we  know  that  no  node  need  have  a 
label  greater  than  Lj(w*-1)  So,  w*-l  buckets 

of  width  Lj  suffice  since  the  shortest  path  will 

be  no  longer  than  the  aforementioned  one. 

In  the  case  where  w*  and  w.  are  not  rela¬ 
tively  prime,  the  above  path  wilt  loop  back  to 
node  0  oefore  reaching  many  of  the  other  nodes. 
In  this  case,  the  following  preliminary  computa¬ 
tion  is  carried  out  before  setting  up  the  buck¬ 
ets: 

r=w* 

For  j=l.  M-1 

gj=  r/GCD(r.w.) 
r=GCD(r,Wj) 

where  GCD(i,j)  is  the  greatest  common  divisor  of 
the  integers  i  and  j  and  can  be  found  using 
Euclid's  Algorithm. 

We  observe  that,  corresponding  to  the 
paths  comprised  entirely  of  type  1  arcs  above, 
there  is  now  a  tree  rooted  at  0,  spanning  all  the 
nodes,  and  containing  paths  with  at  most  g.-l 

arcs  of  length  L|.  An  illustration  of  such  a  tree 

is  given  in  Figure  4.  In  this  case  only  nodes 
0,8,  and  4  can  be  reached  using  only  arcs  of 
type  1  before  the  path  cycles  back  to  node  0. 
The  number  of  nodes  so  reachable  is  g^  (three 

in  this  case).  The  original  set  of  w*  .nodes  is 
partitioned  into  gj  pans  which  are  then  each 

partitioned  into  g^  smaller  parts  corresponding 

to  the  nodes  which  are  now  reachable  via  arcs  of 
type  2.  The  partitioning  continues  until  all  nodes 
are  reached.  The  above  can  be  proven  rigor¬ 
ously  in  the  general  case  using  the  properties  of 
cyclic  groups. 

We  now  define  buckets  of  variable  width 
corresponding  to  the  longest  path  in  the  tree 
defined  above  Thus,  there  are  gj-1  buckets  of 

width  Lj  followed  by  g^-1  buckets  of  width  L2, 
etc.  A  total  of  B=  I  fgj-l)  buckets  is  required. 

Since  the  product  of  the  g.'s  is  w»,  B  <  w*. 

We  have  shown  that  less  than  w*  buckets 
are  required.  We  now  show  that  no  node  is 
scanned  more  than  once.  To  do  so,  we  show 
that  the  width  of  each  bucket  is  no  greater 
than  tne  length  of  the  shortest  arc  still  permit¬ 
ted  in  a  path. 


Figure  4:  A  spanning  tree  bounding  the 
lengths  of  shortest  paths 

Note  that  the  paths  from  0  to  j  correspond 
to  ordered  selections  of  objects  of  aggregate 
weight  j;  i.e.,  there  are  distinct  paths  cor¬ 
responding  to  the  selection  of  the  same  objects 
in  a  different  order-  We  can,  and  should,  con¬ 
sider  only  one  path  corresponding  to  each  dis¬ 
tinct  collection  of  objects.  This  is  easily  done 
by  keeping  track  of  the  last  type  of  arc  x  in 
the  path  (which  we  are  doing  with  the  variable 
PR.)  and  only  considering  arcs  with  the  same  or 
hi^er  index  (object  type).  Thus,  we  %rill  find 
solutions  corresponding  to  a  selection  of  objects 
in  non-decreasing  order  of  type. 

Now,  consider  a  node  in  bucket  i  of  width 
w..  This  node  was  either  reached  via  a  path  in 
the  tree  considered  above  or  via  an  arc  whose 
length  is  greater  than  that  of  any  in  that  tree. 
In  latter  case,  the  length  of  the  smallest  arc 
under  consideration  exceeds  the  width  of  any 
bucket,  in  the  former  case,  since  we  are  con¬ 
sidering  arcs  in  increasing  order  of  length, 
again  a  node  can  only  enter  a  bucket  via  an  arc 
at  least  as  long  as  the  width  of  the  bucket. 

CONCLUSIONS  AND  REFINEMENTS 

The  basic  procedure  as  stated  is  seen  to 
have  a  storage  requirement  proportional  to 
w*»w^M  in  the  worst  case  and  runtime  propor¬ 
tional  to  (w*^w)M  in  the  worst  case.  This  is  a 
significant  improvement  over  procedures  of  this 
type  currently  in  use.  Several  refinements  can 
be  made  to  the  basic  procedure,  however,  to 
improve  its  performance  still  further  in  specific 
cases. 

If  WjSWj  and  Pj<Pj,  object  1  can  be  eliminat¬ 
ed  from  consideration.  In  fact,  if  wi 
congruent  to  each  other  modulo  w*  and 

object  i  can  be  eliminated  from  consideration  In 
paths  between  nodes  i  and  j  in  the  range  0  to 
w*-l  (but  not  other  nodes  in  the  case  W<w*W.) 


Note  that  at  mo.sl  w*-l  types  of  arcs  can  thus 
remain  betwien  the  nodes  0  and  vc*-l  and  both 
the  storage  and  running  time  become  functions 
only  of  w*,  not  w  or  M.  (In  the  case  of  in¬ 
equality  constraints,  this  dominance  can  be 
pushed  still  farther  and  object  i  can  be  eliminat¬ 
ed  if  w.  2  Wj  and  p^  <  Pj.) 

Since  the  shortest  path  to  any  node  is 
bounded  by  v=  (gj'DLj,  when  scanning  a  node 

with  label  b,  arcs  of  length  greater  than  v-b 
can  be  eliminated  from  consideration.  Since  arcs 
are  scanned  in  ascending  order  of  length,  this  is 
easily  implemented  by  terminating  the  scan  when 
the  first  such  arc  is  encountered. 

We  are  currently  carrying  out  computational 
experiments  to  compare  the  efficiency  of  this 
procedure  with  others  currently  in  use. 
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Abstract 

Alternating  chain  procedures  can  be  thought  of  as  generalizations  of  the  greedy 
algorithm  in  that  instead  of  accepting  the  best  remaining  element,  they  seek  to 
obtain  a  better  augmentation  by  examining  a  wider  range  of  alternatives.  It  is 
possible  to  generalize  the  notion  of  an  augmenting  sequence  to  include  augmentations 
which  are  in  effect  trees  as  opposed  to  simply  paths  such  that  these  augmentations 
are  sufficient  to  guarantee  optimality.  Unfortunately,  in  the  worst  case,  these 
trees  are  of  exponential  size.  We  examine  the  application  of  such  generalized 
augmenting  sequences  to  the  solution  of  NP-complete  problems  and  examine  their 
effectiveness  and  efficiency. 

Introduction 

The  theory  of  NP-completeness,  which  was  first  expounded  by  Cook  [1],  has  led 
to  a  search  for  a  unified  treatment  of  combinatorial  optimization  problems.  Cook 
was  able  to  characterize  a  very  large  class  of  interesting  and  important  problems 
as  being  equivalent  in  the  sense  that  an  efficient  algorithm  capable  of  finding  an 
optimal  solution  to  any  one  of  these  problems  can  be  used  to  obtain  optimal  5olu> 
tions  to  all  of  the  others.  Many  papers  by  many  authors  and  an  excellent  com¬ 
pendium  (2]  of  problems  in  this  class  (as  well  as  techniques  for  proving  that  a 
problem  is  in  this  class)  have  been  published  since  Cook's  seminal  paper.  Prob¬ 
lems  in  this  class  are  called  NP-complete  problems  (or,  more  properly,  NP-hard  when 
they  are  optimization  problems  as  opposed  to  decision  problems). 

Cook's  results  can  be  interpreted  in  several  ways.  One  of  these  is  to  say  that 
many  clever  people  have  spent  many  years  trying  and  failing  to  find  efficient 
algorithms  for  individual  problems  in  this  class.  Surely  one  of  them  would  have 
succeeded  if,  in  fact,  such  algorithms  existed.  Hence,  it  is  unlikely  that  such  an 
algorithm  will  be  found  and  it  is  tempting  to  stop  looking  (or  one.  This  leads  to 
the  development  of  heuristics  for  the  solution  of  such  problems  [3]  and  to  proba¬ 
bilistic  methods  [4]. 

An  alternate  interpretation  is  that  this  pessimistic  view  is  justified  only  with  re¬ 
spect  to  algorithms  which  guarantee  optimal  solutions  and  reasonable  runtimes  for 
all  instances  (input  data  sets)  of  a  problem.  In  this  paper  we  speak  of  an  algo¬ 
rithm's  runtime  being  reasonable  if  it  grows  polynomially  rather  than  exponentially 
with  the  size  of  the  problem.  Thi.s  does  not  preclude  the  existence  of  algorithms 
with  guaranteed  reasonable  runtimes  and  which  yield  optimal  (or  near-optimal) 
solutions  with  high  probability.  Nor  does  it  preclude  the  existence  of  algorithms 
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which  guarantee  optimal  solutions  and  which  have  reasonable  runtimes  with  high 
probability.  There  are  many  examples  of  both  types  of  algorithms  which  are  used 
in  practice  to  solve  specific  NP-complete  problems.  Most  important,  the  theory  of 
NP-compIeteness  does  not  preclude  or  even  lessen  the  likelihood  of  the  existence  of 
algorithms  which  solve  specific  (nontrivial)  instances  of  a  problem  and  guarantee 
both  an  optimal  solution  and  reasonable  runtime. 

In  this  paper,  we  explore  this  second,  more  optimistic,  point  of  view  and  present 
a  family  of  algorithms  for  the  solution  of  an  NP-complete  problem.  Some  algorithms 
in  this  family  have  guaranteed  reasonable  runtimes.  Others  guarantee  optimal 
solutions.  While  the  algorithms  are  presented  for  the  solution  of  a  specific  prob¬ 
lem,  the  technique  can  be  extended  to  the  solution  of  other  problems  as  well. 

Matroid  Theory 

A  specific  way  of  approaching  the  solution  of  many  combinatorial  optimization 
problems  is  via  matroid  theory.  The  excellent  book  by  Lawler  (5]  gives  a  complete 
treatment  of  this.  Here  we  outline  the  fundamentals  of  this  theory  which  are 
necessary  for  the  presentation  which  follows. 

A  matroid  is  a  couple  (C,F)  where  E  is  a  finite  set  of  m  elements; 

E  »  (ej  I  i  =  1,2,  ...  M} 

and  F  Is  a  family  of  independent  subsets  of  E.  The  notion  of  independence  is 
quite  general.  We  require,  however,  that  it  satisfy  two  properties: 

PI:  Every  subset  of  an  independent  set  is  independent,  i.e.,  if 

I  c  F  and  I  C  I  J  ^  F 

P2:  If  Ip  and  are  independent  subsets  of  E  containing  P  and  P  +  1 

elements,  respectively,  then  there  exists  an  element,  e  t  Ip^j  (e  t  Ip) 
such  that  Ip  ^(e]  is  an  independent  set  containing  P  1  elements. 

Given  two  matroids.  (E,F,)  and  (E,r2),  defined  on  the  same  set  of  elements,  but 
using  two  different  notions  of  independence,  we  define  an  intersection  of  them  to 
be  any  subset,  1(2  E,  such  that  I  t  Fi,  and  I  c  F*.  This  definition  can  be  ex¬ 
tended  to  cover  three  or  more  matroids  as  well. 

Many  combinatorial  optimization  problems  can  be  thought  of  as  finding  the  be.st 
independent  set  in  a  matroid  or  the  best  intersection  of  two  or  more  matroids.  If 
weights,  w.,  are  associated  with  the  elements,  e^,  in  E,  then  one  can  speak  of  the 
best  set  as  being  the  one  with  largest  total  weight.  The  maximal  (or  minimal) 
spanning  tree  problem  can  be  thought  of  as  finding  the  maximum  (or  minimum) 
weight  independent  set  in  a  matroid  (E.F)  where  E  is  the  set  of  edges  in  the 
graph  and  F  is  the  family  of  forests.  A  forest  is  defined  to  be  a  set  ot  0  or  more 
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edges  which  do  not  contain  a  circuit.  As  another  example,  I.awler  [5,  p  304) 
shows  that  the  Traveling  Salesman  Problem  can  be  thought  of  as  finding  the  best 
intersection  of  three  matroids.  The  problem  of  finding  the  maximum  weight  inter¬ 
section  of  three  matroids  has  been  shown  to  be  NP-complete  |2].  Lawler  shows  [5, 
p.  364]  that  the  problem  of  finding  intersections  of  four  or  more  matroids  can  be 
reduced  to  that  of  finding  intersections  of  three.  There  are  many  other  combina¬ 
torial  optimization  problems  which  can  be  naturally  thought  of  as  matroid  intersec¬ 
tion  problems.  The  theory  of  NP-completeness  assures  us  that  all  problems  can  be 
thought  if  in  this  way. 

We  will  consider  one  of  the  simplest  possible  3-Matroid  Intersection  Problems  in  the 
sequel  for  the  sake  of  clarity.  The  problem  considered  is  the  Three  Dimensional 
Assignment  Problem  (TDAP).  In  this  problem,  we  are  given  N  people,  N  jobs, 
and  N  days.  There  is  a  cost,  of  having  person  i  doing  job  j  on  day  k.  Each 
person  is  to  do  only  one  job,  each  job  is  to  be  done  only  once,  and  only  one  job 
is  to  be  done  on  a  day.  Formally  the  problem  is: 


Minimize  2=1  C... 

j  j  k  >JK  iJk 


such  that 


J  V  '‘ijk  '  1  •••  N 


Xjjij  e  {0,1} 


Thus,  setting  to  1  corresponds  to  having  person  i  do  job  j  on  day  k.  This 
problem  can  be  viewed  as  an  intersection  of  three  partition  matroids.  Given  a  set 
of  elements,  E  (in  this  case,  the  a  partition  matroid  can  be  defined  by  a 

partition  of  E  and  a  vector.  A,  constraining  the  number  of  elements  of  E  which 
may  be  selected  from  any  part  of  the  partition.  Formally,  we  have  the  partition  of 
E  into  subsets  Ej,  j  =  1,  ...  k,  where 

U  Ej  =  E  and  Ej  Ej  =  ♦  for  1  j 

j 

and  an  integer  vector  A  =  (a^l  j  =  1.  ...  k| 

A  matroid  (E.F)  is  then  defined  where  F  consists  of  all  subsets,  I,  of  E  formed  by 
selecting  no  more  than  a^  elements  of  E^. 

In  the  case  of  the  TDAP,  the  first  partition  of  the  X-j^  is  by  person,  i.e., 

Ei  =  {Xjj^lj  =  1,  ...  N;  k  =  1.  ...  N) 

and  a^  =  1  for  all  i.  The  independent  sets  in  this  first  matroid  correspond  to 
assigning  each  person  at  most  one  job.  Similarly,  two  more  matroids  can  be  de¬ 
fined  to  constrain  jobs  and  days.  Intersections  of  these  three  matroids  correspond 
to  feasible  partial  assignments  and  intersections  of  maximum  cardinality  correspond 
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to  feasible  complete  assignments.  If  we  define  weights  W..|^  associated  with  the 
'"lik  =  C  -  CijK 

where  C  is  larger  than  any  then  the  maximum  weight  intersection  corre¬ 

sponds  to  the  optimal  solution  to  the  TDAP. 

Augmenting  Paths 

We  now  define  a  family  of  algorithms  for  the  solution  of  matroid  intersection  prob¬ 
lems.  These  are  generalizations  of  the  basic  procedure  given  in  [6]. 

Given  a  matroid  (E,F)  (and  hence  a  notion  of  independence)  and  a  (not  necessarily 
independent)  subset  S.  of  E,  we  define  the  span  of  S,  denoted  sp(S),  as  S  to¬ 
gether  with  ail  elements  of  E  not  independent  of  the  elements  in  S,  that  is 

sp(S)  >  (  e  I  I  v^{e}  ^  F  where  1  is  any  independent  subset  of  S) 

If  S  is  an  independent  set  and  e  e  sp(S)  then  e  forms  a  unigue  cycle,  which  we 
denote  by  C(e),  *rith  S.  A  cycle  is  a  dependent  set  which  becomes  independent  if 
amy  element  is  removed  from  it. 

If  the  matroid  intersection  problem  only  involves  two  matroids,  we  can  obtain 

a  maximum  weight  intersection  by  producing  a  sequence  of  intersections, 
containing  K  elements,  for  K  =  1,2,  ...  m.  Each  is  the  maximum  weight 

intersection  containing  K  elements.  The  algorithm  which  produces  the  is 

called  an  augmenting  path  procedure  because  it  augments  to  produce 
by  finding  the  longest  path  in  the  graph  defined  below. 

We  define  G'  '  to  be  a  bipartite  graph  with  nodes  corresponding  to  the  elements, 
ej,  of  E  plus  distinguished  start  and  finish  nodes,  a  and  z.  Directed  arcs  are 
defined  as  follows: 


(a,i) 

i  c  E  -  sp,(I^*^^) 

(i.j) 

i  e  E  - 

j  e 

(l.z) 

i  e  E  -  spid^*^^) 

(j,l) 

1  e  E  - 

i  e  C^^)(i) 

Paths  from  a  to  z  correspond  to  augmentations  of  1^*^^  that  is,  to  sets  of  element.^ 
to  be  added  or  deleted  from  to  produce  an  intersection  with  K  «  I  elements. 
Notice  that  all  a  to  z  paths  go  alternately  through  nodes  not  contained  in 
(which  are  to  be  added  to  and  nodes  in  1^*^^  (which  are  to  be  deleted). 

Note  also  that  there  is  one  more  node  of  the  former  type  than  there  is  of  the 
latter  and  hence  an  augmentation  results.  If  we  associate  lengths  with  the  arcs 
equal  to  the  weights  of  the  elements  which  the  nodes  correspond  to  (positive  for 
elements  to  be  added  and  negative  for  elements  to  be  deleted),  then  the  length  of 
a  path  corresponds  to  the  incremental  weight  of  the  augmentation.  The  longest 
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path  results  in  an  optimal  augmentation.  Such  a  path  can  be  found  using  a  short- 

(K) 

est  path  algorithm  suitably  modified  to  find  longest  paths.  '  contains  no 

positive  cycles  and  so  the  algorithm  converges. 

These  augmentations  do,  indeed,  result  in  intersections.  As  one  passes  through 
nodes  from  a  to  z  we  see  that  an  element  is  added  preserving  independence  in  the 
first  matroid  but  not  the  second.  An  element  is  then  deleted  restoring  indepen¬ 
dence  in  the  second  matroid  and  hence  the  intersection.  A  node  is  then  added 
which,  because  of  the  deleted  node,  maintains  independence  in  the  first  matroid. 
This  process  continues  until  the  added  element  maintains  independence  in  the 
second  matroid  as  well  as  the  first,  thus  completing  the  augmentation. 

As  an  example,  consider  a  two  dimensional  assignment  problem  (involving,  say, 
only  people  and  jobs.)  The  W^^'s  for  this  problem  are  given  in  Figure  1.  I'  ■'  is 
clearly  11,22,  i.e.,  person  1  assigned  to  job  1,  and  person  2  assigned  lo  2 

G^^^  is  shown  in  Figure  2.  The  arc  lengths  are  shown  as  are  the  lengths  of  the 
longest  paths  to  each  node  from  node  a.  The  longest  a  to  z  path  is  a, 11, 12, 22, 23, z 
which  corresponds  to  deleting  11  and  22  from  the  intersection  and  adding  31,12, 
and  23.  The  length  of  this  path.  7,  is  the  difference  between  the  weight  of  P  * 
and  A  complete  description  of  this  process  and  a  proof  of  its  validity  is 

given  in  IS). 

Generalized  Auqumenting  Paths 

In  the  graph  shown  in  Figure  2,  one  can  obtain  an  optimal  augmentation  (i.e.,  one 
which  takes  us  from  an  optimal  assignment  of  K  elements  to  an  optimal  assignment 
of  K  1)  because: 

1.  If  the  current  intersection  is  not  maximal  then  an  augmenting  path  exists. 

2.  The  labels  given  to  the  nodes  during  the  longest  path  algorithm  completely 
summarize  the  augmenting  paths. 

We  now  wish  to  generalize  the  notion  of  an  augmenting  path,  and  hence  the  entire 
procedure,  to  the  problem  of  the  intersection  of  three  matroids.  One  way  of  doing 
this  is  to  "freeze"  one  of  the  matroids  and  only  consider  alternating  sequences 
within  the  other  two.  In  this  case  the  first  node,  S|  in  an  augmenting  path  would 
be  independent  of  1^^^  in  two  of  the  three  matroids  (or  in  all  three,  in  which  case 
it  is  the  only  node  in  the  augmenting  path).  Say  S|  is  independent  of  I  '  in  the 
first  and  third  matroids.  We  could  then  freeze  the  third  matroid  and  maintain  the 
same  span  within  the  third  matroid  throughout  the  augmenting  path.  Thus,  the 
deletion  of  Sj  for  i  even  reduces  this  span  and  the  addition  of  s.  for  i  odd  restores 
it.  We  thus  reduce  the  search  space  to  two  matroids  and  the  same  polynomial 
bounded  procedure  will  work.  Note  that,  alternatively,  we  could  have  considered 
the  first  matroid  frozen.  Indeed,  it  is  so  frozen  in  the  two  matroid  intersection 
algorithm.  Thus,  there  are  three  types  of  augmenting  paths,  one  for  each  matroid 


within  which  S|  is  dependent.  Unfortunately,  while  this  procedure  is  polynomial 
bounded,  it  does  not  guarantee  optimal  solutions  as  there  are  augmentations  which 
have  no  such  corresponding  argumenting  path. 


In  order  to  guarantee  that  all  augmentations  are  explored,  we  must  relax  the 
definition  of  an  augmenting  path  still  further  to  include  cases  where  independence 
is  not  necessarily  restored  by  the  deletion  of  s^  for  i  even.  Thus,  an  augmenting 
path  may  start  with  any  element,  s,,  which  is  independent  of  P  '  in  at  least  one 
of  the  matroids.  Unlike  the  procedure  given  for  two  matroids,  one  may  begin  with 
independence  in  any  matroid.  Consider  the  graph  shown  in  Figure  3  corresponding 
to  two  augmenting  paths.  Path  1  and  Path  2,  for  the  partial  assignment  111,222,333 
(i.e.,  person  1  to  job  1  on  day  1,  etc.)  in  a  TDAP.  These  paths  arc  not  strictly 
comparable  in  that  they  exclude  different  elements  along  the  way.  Thus  in  Figure 
2,  when  node  22  is  labeled  using  the  path  a, 32, 22  it  is  equivalent  (in  terms  of  how 
the  path  can  continue,  not  necessarily  in  terms  of  the  numerical  value  of  the  label) 
to  being  labeled  using  the  path  a, 31, 11, 12, 22.  In  Figure  3,  however,  when  node 
111  is  labeled  using  the  path  a, 411, 111  it  is  different  from  labeling  111  using  the 
path  a, 154,111  because  different  continuations  of  these  paths  are  possible.  Thus 
starting  with  a, 411, 111  we  can  continue  to  152  but  not  215  and,  conversely,  start¬ 
ing  with  a, 154, 111  we  can  continue  with  215  but  not  152.  Thus,  Path  1  and  Path 
2  are  not  comparable  in  terms  of  their  lengths  only. 

Such  paths  must  also  be  compared  in  terms  of  their  spans.  We  note  that  if  two 
paths  from  a  to  some  node  i  result  in  sets  having  identical  spans  then  the  same 
continuations  of  both  paths  are  possible.  (This  was  the  case  for  intersections  of 
two  matroids.)  Indeed,  it  is  possible  for  paths  to  have  slightly  different  spans 
and  still  have  the  same  set  of  possible  continuations.  In  particular,  if  the  only 
difference  in  the  intersections  of  the  spans  of  two  paths  are  nodes  outside  the 
intersection  of  the  spans  of  I^*^\  then  the  paths  are  comparable.  We  can  thus 
generalize  the  augmenting  path  procedure  to  consider  all  undominated  a  to  ?.  paths 
where  one  path  dominates  another  only  if  it  has  the  same  continuations  and  a 
larger  length. 


The  notion  of  a  path  itself,  however,  must  be  generalized  as  well.  In  the  case  of 
3  matroids,  not  all  augmentations  correspond  to  paths.  We  see  an  example  of  this 
for  a  TDAP.  The  augmentation  (412,234,341,123)  -  (111,222,333)  does  not  cor¬ 
respond  to  any  path  in  the  conventional  sense.  It  is  possible  however,  to  extend 
the  augmenting  path  procedure  to  include  such  augmentations  by  extending  the 
notion  of  a  path. 


We  define  a  generalized  augmenting  path  with  respect  to  an  intersection  I^*^^  to  be 
a  sequence  of  nodes  S  =  (s,,S2,  . 

1^*^^  for  even  i.  As  before,  I^*^^  ♦ 


•  s_)  where  s.  e  E  -  1 
tn  ) 

ll  -  Sj  ♦  Sj  - 


(K) 


for  odd 


and  S|  c 
Also, 


♦  s_  is  an  intersection, 
m 

the  even  s.  are  deleted  in  order  to  remove  dependencies  created  by  the  inclusion 
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of  the  odd  Sj.  Now,  however,  the  subsequences  1^*^^  +  s,  -  Sj  ♦  for  even  j 

need  not  correspond  to  intersections. 

One  can  thus  guarantee  an  optimal  intersection  as  in  the  case  of  twu  matroids.  The 
number  of  generalized  augmenting  paths  one  may  need  to  consider,  however,  may 
grow  exponentially  with  K.  In  practice,  however,  the  number  of  such  paths  can 
be  controlled  at  the  expense  of  optimality.  First,  the  length  of  any  path,  (s,,  s^, 

Sj),  can  be  reduced  by  a  penalty  to  account  for  the  nodes  which  still  must  be 
deleted  to  restore  the  intersection.  In  the  case  of  arbitrary  matroids,  this  may  be 
complex  to  compute.  In  the  case  of  the  TDAP,  however,  where  3  partition 
matroids  are  Involved,  and  all  cycles  contain  2  elements,  it  is  easily  computed. 

In  some  cases  the  above  may  keep  the  computations  reasonable.  In  others,  it  may 
be  necessary  to  reduce  the  number  of  paths  considered  by  relaxing  the  definition 
of  dominance.  This  will  also  result  in  a  heuristic  rather  than  an  optimal  solution. 
In  the  case  of  the  TDAP,  one  such  relaxation  is  to  ignore  differences  in  the  spans 
outside  the  intersection  of  the  span  of  r  ' .  This  is  motivated  by  the  fact  that 
we  consider  deleting  elements  in  1^*^^  in  order  to  include  elements  blocked  by 
them. 

Wc  can  thus  consider  a  hierarchy  of  generalized  augmenting  path  procedures  with 
increasingly  stringent  dominance  criteria  and  increasing  runtime.  A  tradeoff 
between  optimality  and  runtime  is  then  available.  We  are  currently  investigating 
this  tradeoff  using  the  TDAP  as  an  example. 
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Two  shortest  path  algorithms  are  compared  and  it  is  shown  that,  while  one  outper¬ 
forms  the  other  in  practice,  the  former's  running  time  is  exponential  in  the  worst  case 
while  the  latter’s  is  polynomial.  A  procedure  which  constructs  such  worst  case  ex¬ 
amples  IS  given. 


In  the  excellent  paper  “A  Computational  Analysis  of  Alternate  Algorithms  and 
Labeling  Techniques  for  Finding  Shortest  Path  Trees”  by  Dial,  Glover,  Karney,  and 
Klingman  1 1  ]  the  statement  is  made  “that  the  label-setting  algorithm  has  a  worst  case 
bound,  that  is  an  order  of  magnitude  better  than  the  label-correcting  algorithm.”  It  is, 
in  fact,  easily  shown  that  this  statement  is  true  for  the  performance  of  Algorithm  Cl 
relative  to  the  performance  of  a  label  setting  method.  In  Algorithm  Cl,  since  the 
sequence  list  is  managed  in  a  FIFO  fashion,  paths  ate  generated  in  order  of  the  number 
of  arcs  in  them.  Each  node  can  thus  be  scanned  at  most  iV  times  (creating  paths  with 
K  arcs,  A  =  1 .... ,  N),  where  N  is  the  number  of  nodes.  Since  nodes  are  scanned  only 
once  in  label  setting  algorithms,  the  result  follows. 

The  situation  is  surprisingly  different  with  respect  to  Algorithm  C2.  I  have  used  this 
algorithm  to  find  routes  in  very  large,  very  sparse  real  networks  (thousands  of  nodes 
and  average  nodal  degree  between  2  and  3)  with  a  variety  of  length  functions  (gen¬ 
erally  distance-related)  and  have  found  it  to  outperform  all  others.  This  is  exactly 
as  the  authors  indicated.  The  worst  case  behavior  of  this  algorithm,  however,  is 
exponential'. 

Consider,  for  example,  the  network  in  Figure  1.  Suppose  that  the  list  of  nodes  adja¬ 
cent  to  node  1  (the  root)  is  in  ascending  order  of  node  number  and  that  the  list  of 
nodes  adjacent  to  all  other  nodes  is  in  descending  order.  We  find  that  every  time  a 
node  is  scanned,  node  2  is  relabeled  and  scanned.  In  fact,  node  2  takes  all  labels  be¬ 
tween  20  and  5  and  is  scanned  every  time  it  is  relabeled.  Here  Af  =  6  and  node  2  is 
scanned  2^"’  times.  Networks  with  this  characteristic  and  of  arbitrary  size  can  be 
formed  as  follows. 

Step  I:  Start  with  a  network  with  two  nodes.  1  and  2.  and  two  arcs  (1.  2)  and 
(2,  I)  with  /,(1.  2)  =  I  and  L(l,  1)  =  I.  where  L(i,  j)  is  the  length  of  the 
arc  from  /  to  /. 
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Step  2:  For  K-Z  toN  alter  the  network  as  lollows: 

(a)  Add  node  K. 

(b)  Add  arcs  (K,  r)  for  i  =  2 . K-  1  with  /.(A.  i )  =  Z.(l.  /) 

(c)  Set/,(l,/)  =  L(l.r)+2^-^  +  1. 

(d)  Add  an  arc  (1,  A)  with  f,(l.  A)  =  I. 

During  the  execution  of  algorithm  C2.  the  node  2  lakes  all  labels  from  A'-  1  to 
2^-1  +  Af-  2  and  is  scanned  2^’’  times. 

It  should  be  noted  that  this  pathological  situation  is  a  product  not  only  of  the  un¬ 
usual  arc  lengths  but  also  of  the  ordering  of  the  adjacency  lists,  in  particular  that  lists 
of  adjacent  nodes  are  ordered  with  the  nearest  node  first  for  all  nodes  except  the  root. 
If  the  adjacency  lists  were  ordered  oppositely,  each  node  would  be  scanned  only  once. 
Thus,  we  see  a  strong  dependence  of  the  performance  of  Algorithm  C2  on  a  preorder¬ 
ing  of  the  links.  The  general  idea  would  be  to  order  links  in  such  a  way  that  they 
would  be  likely  to  be  scanned  in  ascending  order  of  their  labels.  Alternatively,  adja¬ 
cency  lists  could  be  ordered  nearest  nude  first  and  the  sequence  list  management  could 
be  altered  only  slightly  to  add  the  entire  list  of  relabeled  nodes  to  top  of  the  sequence 
list  in  forward  order  (rather  than  the  reverse  order  implied  in  Algorithm  C2)  of  the 
adjacency  list  of  the  node  being  scanned. 

In  summary,  we  note  that  Algorithm  CZ  has  excellent  behavior  in  practice  and  pre¬ 
processing  is  probably  warranted  only  if  m.iliiple  applications  of  the  algorithm  are 
being  done  (e.g..  from  multiple  sources). 
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HD.  Probabalistic  Analysis  of  Algorithm  Performance 

The  problem  of  finding  an  optimal  set  of  repeater  locations 
covering  a  given  set  of  terminal  sites,  or  set  of  potential  repeater 
locations  and  a  covering  matrix  specifying  which  terminal  sites  can  be 
covered  by  each  repealer  location,  is  an  important  problem  in  the 
design  of  multihop  packet  radio  networks.  The  problem  in  an  instance 
of  the  classic  set  covering  problem  which  has  many  other  applications 
as  well.  The  set  covering  problem  is  known  to  be  NP-completc  and 
as  such  it  is  unlikely  that  an  algorithm  will  be  found  which  can 
guarantee  an  optimal  solution  and  also  guarantee  a  reasonable  running 
time,  that  is  a  runtime  which  grows  polynomially  in  the  number  of 
terminals  and  repeaters. 

The  theory  of  nondeterministic  polynomial  completeness  (NP- 
completeness)  basically  states  that  there  is  a  large  class  of  problems, 
which  includes  almost  all  optimization  problems  of  interest  in  the  area 
of  network  design,  which  have  the  property  that  a  solution  to  one 
could  be  used  to  obtain  a  solution  to  all  the  others  in  a  reasonable 
amount  of  time.  It  has  been  shown  that  one  problem,  the  Satisfiability 
Problem,  has  the  property  that  all  problems  in  this  class  can  bt 
transformed  into  an  instance  of  it.  Thus,  an  algorithm  which  could 
obtain  an  optimal  solution  to  the  Satisfiability  Problem,  could  be  used 
to  solve  all  the  others.  Techniques  have  been  developed  for  proving 
that  many  other  problems  are  NP-complete.  Proofs  have  been  given 
for  the  NP-completeness  of  over  200  other  well-known  problems. 

In  our  previous  work  we  reported  on  results  obtained  using  a 
new  set  covering  algorithm.  The  algorithm  obtained  optimal  solulions 
for  moderate  sized  problems  with  up  to  300  node  networks  and  cover- 
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age  related  to  distance.  That  is,  if  the  average  matrix  reflected  the 
fact  that  a  repeater  was  far  more  likely  to  cover  a  nearby  terminal 
than  one  farther  away,  then  the  algorithm  worked  very  well,  con¬ 
verging  to  an  optimal  solution  very  quickly. 

If  on  the  other  hand,  the  coverage  matrix  was  random,  the 
algorithm  ran  much  more  slowly  and  only  problems  with  up  to  .50 
terminals  could  be  treated  efficiently.  In  examining  why  the 
algorithm  had  difficulty  treating  problems  with  random  data,  we 
observed  that  the  difficulty  stemmed  from  the  large  number  of  optimal 
and  near  optimal  solutions  available.  The  algorithm  had  no  difficulty 
finding  an  optimal  solution  but,  rather,  had  difficulty  in  verifying  the 
solution  was  optimal  in  a  reasonable  amount  of  time  due  to  the  presence 
of  a  large  number  of  alternate  solutions  of  comparable  quality.  This 
led  us  to  conjecture  that  it  might  be  possible  to  develop  algorithms 
which  had  a  reasonable  running  time  and  which  could  find  optimal 
time  or  near-optimal  solutions  with  high  probability.  It  is  known  that 
asymptotically,  as  the  size  of  the  problem  becomes  infinite,  there 
exists  algorithms  which  give  optimal  results  almost  always.  These 
results  are  an  outgrowth  of  the  same  symmetry  which  makes  it  difficult 
to  verify  the  optimality  of  the  solutions  produced  by  our  set  covering 
algorithm. 

These  results  are  encouraging  but  of  no  direct  use  in  solving 
network  design  problems  since  real  problems  are  of  finite  size  and 
nothing  was  said  about  how  fast  the  algorithms  converge  probablis- 
tically  to  an  optimum;  i.e.,  what  the  probability  is  of  their  finding 
the  optimum,  or  a  solution  within  some  bound  of  the  optimum.  We 
thus  sought  to  investigate  what  could  be  said  about  the  probabalistir 
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performance  of  known  heuristics  for  problems  of  moderate  si/e.  We 
chose  the  set  covering  problem  mentioned  above  as  the  first  problem 
to  investigate.  The  probabalistic  model  of  this  problem  follows. 

We  are  given  a  set  T={ti,  ...  tj^ }  of  N  terminals,  a  set 


of  M  potential  repeater  sites,  and  an  ISi  x  M 


K={ri,  r2,  .. 

covering  matrix,  C,  where  c.^  is  1  if  L  can  be  covered  by  r^ .  We 
seek  a  subset  of  R  containing  as  few  repeaters  as  possible  and  cover¬ 
ing  all  terminals.  The  elements  of  C  are  chosen  randomly  and  inde¬ 
pendently.  Specifically,  we  assume  Pr{c.pl}=p  and  Pr{C.pU}=l-p=q . 
We  define  R^,  the  set  of  terminals  covered  by  r^,  by: 

R.  =  {t.|c..  =  1} 

J  ‘  i'  ij 

Similarly,  we  define  T.,  the  set  of  repeaters  covering  t. ,  by: 

Ti  -  (r.lc..  -  1} 

A  cover  of  the  terminals  is  then  defined  as  a  subset  SCR  satisfying: 


U  R.  =  T 

r.eS  ‘ 

] 

A  minimum  cover,  S*,  is  then  a  cover  containing  as  few  elements  as 
possible ,  i .  e .  : 


U  R.  =  T 

r.eS*  ^ 

J 

and  |S*|  <  |S|  for  all  S  such  that  U  R.  =  T. 

r.eS  ^ 

J 

Note  that  the  minimal  cover  is  not  in  general  unique.  Indeed 
our  approach  rests  on  the  existence  of  a  large  number  of  minimal  and 
near  minimal  covers. 

In  order  to  analyze  the  probabalistic  behavior  of  a  heuristic  we 
must  do  two  things.  First,  for  a  given  set  of  values  of  N,  M,  and 
p,  we  must  find  the  probability  that  |S*|  =  K,  that  is,  the  probability 
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that  there  exists  a  minimal  covering  containing  exactly  K  repeaters. 
Let 

P(N,M,K)  =  Pr{lS*l  =  K}  for  given  N,M,  and  p. 


Next  we  must  find  P^j^^(N  ,M,K),  defined  as  the  probability  that  the 
algorithm  will  find  a  solution  with  exactly  K  repeaters.  A  figure  of 
merit  for  evaluating  the  probabalistic  behavior  of  the  algorithm  is 


1  KP  (N,M,K)  -  1  KP(N,M,K) 

K=0  K=0 


1  KP(N,M,K) 
K=0 


(lID.l  ) 


This  is  simply  the  relative  error  made  by  the  algorithm,  i.e., 
the  difference  between  the  avarage  number  of  repeaters  in  a  covering 
found  by  the  algorithm  and  the  minimum  number  required,  normalized 
by  the  minimum  number  required.  We  believe  that  ym^  ^ALG  ^  ^ 
any  M,  and  p,  and  any  reasonable  algorithm  where  a  reasonable 
algorithm  is  defined  as  one  which  stops  when  it  has  a  cover.  In 
particular,  we  believe  ^n  algorithm  which  picks  repeaters 

randomly  until  a  covering  is  obtained. 

We  begin  by  evaluating  Q(N,M,K),  the  probability  that  a  ran¬ 
domly  chosen  set  of  K  repeaters  covers  all  N  terminals.  Notice  that 
this  is  not  the  same  as  the  probability  that  an  algorithm  picking 
repeaters  at  random  will  stop  after  K  repeaters  since  in  some  cases 
fewer  than  K  will  also  suffice  and  the  algorithm  will  find  them.  Also, 
Q(N,M,K)  is  a  cumulative  distribution,  that  is  it  relates  to  the  proba¬ 
bility  that  K  or  fewer  repeaters  are  required  for  a  covering  rather 
than  exactly  K.  We  do  have,  however,  that 
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K 

Q(N,M,K)  <  I  P(N,M,K)  (  111).2 ) 

L=0 

that  is,  the  probability  of  a  randomly  chosen  set  of  K  repealers 
covering  all  N  terminals  is  less  than  or  equal  to  the  probability  of  th(! 
existence  of  a  covering  containing  K  or  fewer  terminals.  Thus  we 
have , 

P(N,M,K)  >  Q(N,M,K)  -  Q(N,M,K-1),  K>1, 
that  is,  the  difference  between  two  successive  Q's  is  a  lower  bound 
on  the  corresponding  P. 

Q(N,M,K)  is  easy  to  evaluate  because  repeaters  picked  randomly 

are  picked  independently  of  one  another  and  our  random  model 

assumed  their  original  characteristics  are  independent.  The 

probability  of  a  single  terminal  not  being  covered  by  a  single  repeater 

is  q(=l-p).  The  probability  of  a  single  terminal  not  being  covered  by 

K 

any  of  the  K  repeaters  is  q  .  The  probability  of  at  least  one  repeater 

of  K  covering  a  given  terminal  is  then  1-q  .  Finally,  the  probability 

K  N 

that  all  N  terminals  are  covered  by  the  K  repeaters  is  (1-q  )  .  We 
thus  have  that 

Q(N,M,K)  =  (1  -  q’^)'^  (HD. 3) 

Note  that  this  is  not  a  function  of  M  except  in  the  trivial  sense 
that  M  must  be  no  smaller  than  K.  Figure  llD.l  shows  values  of 
Q(N,M,K)  for  p=.5.  Figure  IID.2  shows  values  of  Q(N,M,K)  - 
Q(N,M,K-1)  for  the  same  examples.  Both  show  a  tendency  for  solu¬ 
tions  to  cluster  about  a  narrow  range  of  K.  This  is  encouraging  in 
that  it  implies  that  probabalistically ,  simple  algorithms  should  do  well. 


CUMULATIVE  PROBABILITY  OF  COVERING, 


FIGURE  IID.l  CUMULATIVE  PROBABILITY  OF  A  COVERING  FOR 
RANDOM  ALGORITHM  (p=0.5). 


PROBABILITY  OF  A  COVERING,  Q (N , M , K) -O (N , 


IID.H 


As  yet,  we  do  not  have  an  exact  analytic  expression  tor 
l’(N,M,K).  VVe  can  get  a  tighter  lower  bound  on  it  by  evaluating  the 
distribuUon  of  solutions  from  increasingly  sophisticated  algorithms.  A 
simple  minded  algorithm  would  be  to  pick  repeaters  randomly  until  one 
obtained  a  covering.  The  probability  of  obtaining  a  solution  with  K 
or  fewer  repeaters  using  this  algorithm  is  Pi(N,M,K)  given  by; 

Pi(N,M,K)  =  1  ^  p‘g’^'*P,(N-i,  M-1,K-1)  (IID.4) 

i=0  ‘ 

th  /  N  \  i  N  “  1 

since  the  K —  repeater  will  cover  i  terminals  with  probability  ^  ^  )p  q 

and  the  remaining  K-1  of  M-1  repeater  must  cover  the  remaining  N-i 

terminals.  Note  that  this  is,  again,  not  a  function  of  M  except  that 

M  must  be  no  smaller  than  K.  The  probability  of  obtaining  a  covering 

with  exactly  K  repeaters  is  given  by  Pi(N,M,K)-P,(N,M,K-] ). 

A  somewhat  more  sophisticated  algorithm  is  to  consider  repeaters 
in  a  random  order  and  to  select  a  repeater  only  if  it  improves  the 
chances  of  obtaining  a  covering  more  by  picking  it  than  by  not 
picking  it.  Letting  P2(N,M,K)  be  the  probability  of  obtaining  a 
covering  containing  K  or  fewer  repeaters  using  this  algorithm,  we 
have 

N 

P2(N,M,K)  =  1  ^  p'q'^''{max  (P2(N-i,  M-l,K-i),  P2(N  ,M-1  ,K))} 

i=0  * 

(HD. 5) 

P2(N,M,K)  is  plotted  for  N=5  and  L-IO  in  Figure  I1D.3.  Along  with 
it,  the  probability  of  obtaining  a  covering  using  K  randomly  chosen 
repeaters  is  plotted.  Notice  that  this  algorithm's  performance  is  a 
substantial  improvement  over  randomly  picking  repeaters.  Notice 
also,  that  P2(N,1V1,K)  is  a  function  of  M.  This  algorithm  is  very 
simple  and  has  a  reasonable  running  time  even  for  very  large  N  and 


C.  PROBABILISTIC  ANALYSIS  OF  ALGORITHMS 

We  have  obtained  results  in  this  area  indicating  that  even  rela¬ 
tively  simple  algorithms  will,  with  high  probability,  obtain  near  optimal 
solutions  to  certain  difficult  optimization  problems.  This  is  encouraging 
in  that  it  implies  that  problems,  previously  considered  intractable,  may 
in  fact  be  solvable,  at  least  with  high  expectation  of  success.  The 
results  are  also  useful  in  guiding  heuristics  towards  potentially  fruitful 
areas  within  the  solution  space  and  in  avoiding  unnecessary  effort  in 
areas  where  little  further  progress  can  be  expected. 


Specifically,  we  have  analyzed  the  intrinsic  probabilistic  behavior 
of  the  3-Satisfiability  Problem  and  found  that,  independent  of  the  size 
of  the  problem,  there  is  only  a  limited  region  of  uncertainty,  i.e.,  only 
a  finite  set  of  values  where  even  the  simplest  sensible  algorithm  can 
make  an  error.  This  problem,  which  is  described  in  detail  below,  is 
representative  of  a  very  large  class  of  difficult  combinatorial  problems 
and  can,  in  fact,  be  used  as  a  vehicle  for  solving  many  other  problems. 
Thus,  we  believe  these  results  can  be  extended  to  other  problems  as 
well,  most  notably,  to  problems  in  network  design  such  as  facility 
location  and  link  topology  optimization.  We  are  currently  in  the  process 
of  investigating  such  extensions  and  of  examining  the  probabilistic 
performance  of  several  specific  heuristic  algorithms. 

C.l.  Probabilistic  Behavior  of  the  3-Satisfiability  Problem 

The  3- Satisfiability  Problem  has  a  simple  and  homogeneous  struc¬ 
ture  which  makes  it  easy  to  develop  and  study  a  probabilistic  model.  It 
also  does  not  involve  any  numerical  data  which  would  otherwise  obscure 
the  nature  and  generality  of  the  results  which  we  obtain.  These  re¬ 
sults  can,  however,  be  directly  extended  to  optimization  problems  which 
do  involve  numerical  data,  as  we  will  see  in  the  following  discussion. 

In  the  ordinary  satisfiability  problem,  we  are  given  a  Boolean 
expression,  E,  over  m  Boolean  variables  Vi,V2,...,v^  and  we  ask  if 
there  exists  a  set  of  truth  values  for  the  variables  which  will  result  in 
E  taking  the  value  True.  Thus,  each  of  the  v.  can  take  either  the 
value  True  or  False  (alternatively  denoted  by  1  or  0)  and  E  will  then 
take  either  the  value  True  or  False  based  on  the  values  of  the  v^.  E  is 
usually  given  in  disjunctive  normal  form,  i.e.,  as  a  set  of  clauses  all  of 
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which  must  be  True  in  order  for  E  to  be  true.  Each  clause  contains 
one  or  more  variables  and  is  said  to  be  true  if  at  least  one  of  the 
variables  in  the  clause  is  assigned  the  value  it  takes  in  the  clause. 
Thus  for  example,  E  =  (vj  +  V2)(Vi  +  V3)  has  2  clauses.  The  first, 
(Vj  +  V2)  is  true  if  either  Vj  is  True  or  V2  is  True.  The  second  is 
True  if  either  Vi  is  False  or  V3  is  True.  Thus  E  is  satisfiable  as  the 
values  Vi  =  True,  V2  =  True,  and  V3  =  True  satisfy  both  clauses  and 
hence  E.  There  are  several  other  assignments  of  truth  values 
and  V3  which  will  satisfy  this  E.  The  expression  (vj)(vi  +  V2)(V2)  on 
the  other  hand  is  not  satisfiable. 

The  3-Satisfiability  Problem  is  a  version  of  ihe  ordinary  Satisfia¬ 
bility  Problem  where  all  clauses  contain  exactly  3  variables.  Garey  and 
Johnson,  in  their  book  Computers  and  Intractability,  show  this  problem 
is  NP-complete  by  showing  that  any  ordinary  Satisfiability  Problem  can 
be  transformed  into  a  corresponding  3-Satisfiability  Problem  of  roughly 
the  same  size.  Thus  the  two  problems  are  equivalent. 

Many  other  problems  can  also  be  transformed  into  corresponding 
3-Satisfiability  Problems.  In  particular,  combinatorial  optimization 
problems  such  as  the  Traveling  Salesman  Problem  or  the  problem  of 
locating  earth  stations  in  a  satellite  network  can  be  so  transformed. 
Thus,  we  can  produce  a  Boolean  expression  which  corresponds  to  a 
particular  optimization  problem  in  the  sense  that  if  the  expression  is 
satisfiable  then  the  optimization  problem  has  a  solution  with  a  value  less 
than  or  equal  to  a  given  constant  (for  minimization  problems)  or  greater 
than  or  equal  to  a  given  value  (for  maximization  problems).  Further¬ 
more,  the  satisfying  truth  assignment  can  be  used  to  obtain  the  solution 
to  the  corresponding  optimization  problem  directly.  By  altering  the 
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value  of  the  constant  in  the  above  transformation  the  optimal  solution  to 
the  corresponding  optimization  problem  can  be  found  via  binary  search; 
i.e.,  if  we  know  the  optimum  lies  between  values  Cj  and  C2,  we  try  the 
value  Cl  +  C2/2.  This  technique  is  known  as  thresholding. 

One  can  determine  if  an  expression  is  satisfiable  by  assigning  all 
z”’  possible  truth  values  to  its  variables.  This  approach  is,  of  course, 
not  practical  for  large  values  of  m.  There  are  many  sophisticated  tech¬ 
niques  for  answering  the  question  whether  or  not  a  given  expression  is 
satisfiable,  but  all  have  running  times  which  ultimately  grow  exponen¬ 
tially  with  the  number  of  variables  and  thus  are  limited  to  problems  of 
modest  size. 

We  obtain  here  a  technique  which  yields  the  a  priori  probability  of 
an  expression  with  a  given  structure  being  satisfiable  given  a  probabil¬ 
istic  model  of  the  space  from  which  the  problem  is  drawn.  For  prob¬ 
lems  where  this  probability,  P  ,  is  very  close  to  1  or  very  close  to 
zero,  we  need  seek  no  further.  Only  for  problems  where  is  signifi¬ 
cantly  different  from  0  or  1  need  the  question  be  investigated  any 
further.  In  some  cases,  the  P^  itself  is  all  that  we  need.  For  exam¬ 
ple,  if  we  have  found,  using  a  heuristic  algorithm,  a  solution  of  value 
Cl  to  a  given  maximization  problem,  and  have  determined  that  P^  is  less 
than  .01  for  an  expression  corresponding  to  the  existence  of  a  solution 
of  value  greater  than  or  equal  to  Ci  +  a  (for  a  much  smaller  than  Cj), 
then  we  are  reasonably  certain  that  there  is  not  much  to  be  gained  from 
an  attempt  at  further  optimization.  This  is  very  important  because  it 
has  often  turned  out  that  it  is  much  harder  to  verify  the  optimality  (or 
near-optimality)  of  a  solution  than  to  find  an  optimal  solution  especially 
when  the  solution  space  is  rich  and  there  exist  many  alternate  near- 
optimal  solutions. 
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We  consider  a  uniform  probabilistic  model  for  the  3-Satisfiability 
Problem  where  all  clauses  are  equally  likely.  Thus  a  problem  is  totally 
characterized  by  v  and  b,  the  number  of  variables  and  clauses,  respec¬ 
tively.  We  will  consider  two  models  which  are  nearly  equivalent.  In 
one  case,  clauses  are  picked  without  replacement,  i.e.,  a  given  clause 
can  appear  at  most  once  in  an  expression.  In  the  other  case,  clauses 
are  picked  with  replacement. 

Duplicate  clauses  do  not  affect  the  satisfiability  of  an  expression . 
We  can  thus  relate  problems  where  the  clauses  are  chosen  with  replace¬ 
ment  to  problems  where  the  clauses  are  chosen  without  replacement  by 
saying  that  a  problem  with  v  variables  and  b  clauses  chosen  with  re¬ 
placement  is  the  same  as  a  problem  with  v  variables  and  b '  clauses 
chosen  without  replacement  if  the  number  of  distinct  clauses  in  the  first 
problem  equals  b'.  The  relationship  between  the  t. umber  of  clauses 
chosen  with  replacement,  b,  and  the  average  number  of  distinct  clauses 
is  given  by  the  following  argument. 

Let  be  the  number  of  distinct  clauses  when  b  clauses  are 
selected  with  replacement.  Then  <  b  and  =  X^  ^  +  a,  where 

0  =  1  with  probability  ,  and  o  =  0  with  probability  1  -  P  .  P 

new  ^  ^  new  new 

is  the  probability  that  the  b^^  clause  is  different  from  all  the  first 
b  -  1.  Pfiew  simply  the  ratio  of  the  number  of  unchosen  clauses  to 

the  total  number  of  possible  clauses.  The  number  of  possible  clauses  is 

B  =  8(^) 

since  each  clause  contains  3  variables  each  of  which  can  take  either  of 

two  values.  Thus  P _ _  is  given  by; 

new  ^  ^ 


P 


new 


(C-1) 
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Let  f(b)  be  the  average  number  of  distinct  clauses.  Then 

E(XblXb.i)  =  (C-2) 

and 

f(b)  =  E(Xj^)  =  f(b  -  1)  +  ^  ~  .  :C-3) 

Thus 

f(b)  =  f(b-l)  X  ^  +  1  .  (C-4) 

As  an  example,  if  v  =  10,  then  B  =  960  and  f(50)  =  48.7.  As  we  will 
see,  we  are  most  interested  in  values  of  b  near  5v.  For  v  =  20  and 
b  =  100,  f  =  99.5.  As  V  increases,  f/b  approaches  1  very  quickly. 
Thus,  there  is  little  difference  between  choosing  clauses  with  or  without 
replacement. 

C.2.  Analytical  Method  1 

We  now  turn  to  the  problem  of  estimating  Pg(v,  b),  the  probability 
of  expression  E,  with  v  variables  and  b  clauses,  being  satisfiable. 
Given  a  problem  with  v  variables  and  b  clauses,  we  define  the  truth 
assignment  graph,  G,  associated  with  that  problem  to  be  a  graph  with 
2^^  +  1  nodes.  Nodes  1  through  2^  correspond  to  the  possible  truth 
values  for  the  v  variables  and  node  0  is  a  distinguished  pseudonode. 
Each  clause  in  E  is  inconsistent  with  one  eighth  of  the  truth  assign¬ 
ments,  specifically  those  in  which  all  three  variables  in  the  clause  have 
values  opposite  to  their  values  in  the  clause.  G  contains  an  arc  be¬ 
tween  node  0  and  any  node  corresponding  to  a  truth  assignment  which 
does  not  satisfy  E.  We  thus  proceed  through  E  clause  by  clause  and 
add  arcs  between  node  0  and  nodes  which  do  not  satisfy  the  clause. 
There  are  2'^”^  arcs  corresponding  to  each  clause.  In  general,  arcs 
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"belonging"  to  different  clauses  overlap,  i.e.,  there  will  be  truth 
values  inconsistent  with  more  than  one  clause.  We  do  not  show  the 
multiplicity  of  these  arcs.  Thus,  G  will  in  general  contain  fewer  than 
b  X  2'^"^  arcs.  We  are  interested  in  G  because  P^,  the  probability  E  is 
satisfiable,  is  precisely  equal  to  the  probability  that  G  is  not  con¬ 
nected.  Figure  C.l  shows  a  truth  assignment  graph. 

The  problem  of  whether  a  random  graph  is  connected  or  not  has 
been  studied  extensively  by  Erdos  and  Renii.  We  will  proceed  along 
similar  lines. 


E  =  (Vi  +  V2  +  V3)  (Vi  +  V2  +  V4)  (vg  +  V3  +  V4) 


FIGURE  C.l.  A  TRUTH  ASSIGNMENT  GRAPH 


We  define  A(v,  k,  b)  to  be  the  number  of  Boolean  expressions 
whose  corresponding  truth  assignment  graphs  contain  k  or  more  isolated 
nodes  (i.e.,  nodes  with  no  incident  links).  Pg(v,  b)  is  then  given  by 


Pgfv,  b)  =  1 


2'^ 

k=0 


A(v,  k,  b) 


(C-5) 


where  B  =  SC^)- 

This  relation  follows  from  the  fact  that  if  an  expression  is  satis- 
fiable  then  its  corresponding  truth  assignment  graph  must  contain  at 
least  one  isolated  node.  Note  that  the  alternating  sum  is  required  in 
order  to  account  for  the  situations  where  graphs  with  more  than  k 
isolated  points  are  included  in  terms  with  k  or  more  isolated  points.  We 
seek  the  number  of  expressions  with  graphs  with  one  or  more  isolated 
points.  The  alternating  sum  gives  us  the  number  with  exactly  zero 
isolated  points. 

We  can  write  A(v,  k,  b)  as 

V 

A(v,  k,  b)  =  I  (7)  2*  NM(v,  i,  k)  B(v,  i,  k,  b)  (C-6) 

i=0  ‘ 

where  NM(v,  i,  k)  is  the  number  of  ways  of  selecting  k  clauses  which 
match  in  i  specific  variables  (i.e.,  k  clauses  in  which  i  specific  vari¬ 
ables  take  the  same  truth  value  in  all  k  clauses)  and  B(v,  i,  k,  b)  is 
the  number  of  Boolean  expressions  whose  corresponding  truth  assign¬ 
ment  graphs  have  k  or  more  isolated  points  and  the  isolated  points 
match  in  i  variables. 


NM(v,  i,  k)  =  'z*  (-1)"’  ‘  2^  =  h  (2^  - 

m=0  k  m 


(C-7) 


where  the  approximation  is  made  by  replacing 

.,v-i-m  ^,v-i-mvk 
(  W  )  by  - 


Now,  B(v,  i,  k,  b)  =  )  where  c(v,  i,  k)  is  the  total 

number  of  clauses  which  are  permissible  in  the  sense  that  they  give 
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rise  to  graphs  that  have  the  requisite  number  of  isolated  points  and 
matched  variables.  c(v,  i,  k)  can  be  estimated  by 

c(v,  i,  k)  =  8(3)  exp[-^ 

^  (2  -2)8v 

The  details  of  this  approximation,  which  are  somewhat  lengthy,  are 
given  in  W.  Chuang's  thesis. 

Using  the  above  approximations  and  Sterling's  approximation 
n!  s  V2rE  e'^ 

we  find  by  algebraic  manipulation  that 

P  (v,  b)  s  1  -  exp[-exp  (-c)l  (C-9) 

O 

where  b  =  8(iln  2^  +  c) 
and  c  is  a  constant. 

Thus,  only  for  b  =  5.5  v  does  P_(v,  b)  take  values  significantly 

O 

different  from  0  or  1  since  for  any  value  of  c  less  than  -2,  P^  is  nearly 
1  and  for  any  value  of  c  greater  than  +4,  P  is  nearly  0.  Figure  C.2 
summarizes  the  relationship  between  P  and  c.  Note  that  this  is  not  a 

o 

function  of  v  or  b  directly.  The  approximations  used  in  obtaining  this 
analytic  form  are,  however,  functions  of  v  as  we  will  see.  The  approx¬ 
imation  turns  out  to  be  reasonable  for  modest  values  of  v  (e.g.,  v  in 
the  range  10  to  30)  which  we  are  interested  in.  Note  especially  the 
sharp  drop  from  1  to  0  over  the  interval  c  =  -2  to  c  =  +4. 

C.3.  Analytic  Method  2 

Because  of  the  approximations  made  in  the  previous  analysis,  we 
sought  a  second  method  to  independently  verify  the  approach.  We 
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consider  the  case  now  where  clauses  are  selected  with  replacement.  We 
note  that  all  clauses  are  equally  likely  and  that  a  given  clause  elimi¬ 
nates  1/8  of  the  possible  truth  assignments.  Indeed,  because  of  this 
symmetry,  on  the  average  the  b^^  clause  in  an  expression  eliminates  1/8 
of  the  remaining  truth  assignments  which  satisfy  the  first  b-1  clauses. 
We  say  a  truth  assignment  is  eliminated  if  it  does  not  satisfy  an  expres¬ 
sion.  Thus,  if  we  define  N  (v,  b)  as  the  number  of  truth  assignments 
satisfying  an  expression  with  v  variables  and  b  clauses,  we  have  the 
recurrence  relation 

v-3  2''  -  NgCv,  b-1)  3 

N^(v,  b>  *  Ng(v,  b-1)  -  2''  t  -  2'' 

2 

=  I  Ng(v,  b  -  1)  =  (|  )^  Ng(v,  0)  (C-10) 

where  N  is  the  expected  value  of  N  . 
s  s 

Since  N^fv,  0)  =  2'^,  i.e.,  all  truth  assignments  satisfy  an  expres¬ 
sion  with  no  clauses,  we  have 


N^(v,  b)  =  (|)^  2'^ 


(C-11) 


P  (v,  b)  is  by  definition  the  probability  that  N  (v,  b)  is  greater 
s  s 

than  zero.  Unfortunately,  we  only  have  an  expression  for  N  ,  the 
average  value  of  N  and  not  for  the  value  of  N  itself. 

o  o 

If  N  (v,  b)  »  1,  then  we  would  expect  P  (v,  b)  s  1.  If  N  (v,  b)  <  1, 
then  we  may  assume  that  N  (v,  b)  is  either  0  or  1.  Then 

O 

P-Cv,  b)  =  P(N  (v,  b)  =  1)  =  N  (v,  b).  Thus  we  use  the  approximation 

O  O  b 

1  ,  N_(V,  b)  >  1 

Pgfv,  b)  =  ®  (C-12) 

NgCv,  b),  Ng(v.  b)  <  1  . 
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This  function  is  very  similar  to  the  one  obtained  by  the  first 
analytic  method  and  again  the  region  over  which  Pg(v,  b)  drops  effec¬ 
tively  from  1  to  0  is  limited  again  to  a  small  range  and  is  independent 
of  V  and  b.  Figure  C.3  illustrates  the  relationship  between  P  and  c 
(where  c  =  )• 

The  dotted  line  of  Fig.  C.3  shows  P  as  found  by  Method  1  for 
the  same  values  of  c.  As  can  be  seen,  for  c  >  1  the  curves  are  vir¬ 
tually  identical.  There  is,  however,  a  5%  difference  between  the  con¬ 
stants  relating  v  and  b  in  the  two  approximations.  Nevertheless,  the 
two  analyses  corroborate  eath  other  and  give  rise  to  the  same  analytical 
behavior  of  P  ;  i.e.,  v  and  b  are  linearly  related  at  the  point  where  P 
drops  and  P^  decays  exponentially. 


C.4.  Simulation 

Finally,  in  order  to  compare  the  accuracy  of  the  two  analytic 

methods,  we  performed  a  simulation  to  measure  P  (v,  b)  directly.  In 

s 

the  simulation,  clauses  were  generated  at  random  with  replacement,  all 
clauses  being  equally  likely.  Each  time  a  clause  was  generated,  the 
truth  values  it  eliminated  were  eliminated.  This  procedure  was  con- 
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FIGURE  C.3.  A  COMPARISON  OF  THE  FIRST  AND  SECOND  METHODS 

FOR  Pg(v,  b) 
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tinued  until  no  truth  values  remained.  For  each  value  of  v,  the  value 
of  b  at  which  the  last  remaining  truth  value  was  eliminated  was  re¬ 
corded  and  became  a  sample,  b. .  An  estimate  of  P  (v,  b)  is  then 

1  s 

formed  by 

(#  of  b.  >  b) 

=  (total  #  of  b.) 

Figure  C.4  gives  the  results  of  this  simulation  for  v  =  10,  20  and  30. 
As  can  be  seen,  the  true  value  of  Pg(v,  b)  lies  between  the  N  values 
predicted  by  the  two  analytic  methods  and  is  somewhat  closer  to  that 
predicted  by  the  second  method.  The  important  property  of  rapid 
decay  from  1  to  0  is  borne  out. 


FIGURE  C.4  COMPARISON  OF  SIMULATION  AND  TWO  ANALYTICAL  METHODS 
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Second-Order  Greedy  Ali(oritlini«  for  Ceniralired 
Teleprocessing  Network  Design 

A.  KERSHENBAUM,  Ml  MUI.H.  Il  I  k,  H  BDOKSIVS. 
MEMBER,  II  l,i:,  AND  R.  OIM'I  .Mil  IM 


Abjiraci — W*  consider  lh«  prublrm  of  desicnine  a  ccnirali/ed 
Iclccomniunicalion  network  comprised  of  multipoint  lines  K'seii  a  set 
of  terminal  locations,  trafTic  requirements,  and  a  common  central 
aiic.  The  optimal  solution  to  this  prublrm  is  a  capacitated  minimal 
spannins  tree-  We  develop  a  class  of  heuristic  alenrithms  for  the 
tolution  of  this  problem  by  imbeddinx  esistinx  heuristics,  referred  to 
aa  first-order  greedy  algorithms,  inside  a  loop  where  smalt,  carefully 
chosen  sets  of  arcs  are  alternately  forced  in  and  out  of  the  solution. 
The  resultant  procedure  is  shown  to  he  superior  to  esisting  tech¬ 
niques,  producing  solutions  typically  2  percent  better,  while  requiring 
only  a  modest  amount  of  additional  computer  time. 

INTRODUCTION 

The  problem  considered  is  that  of  finding  an  optimal 
(minimum  cost)  design  for  a  centralized  telecommunication 
network  given  a  set  of  terminal  locations,  traffic  magnitudes 
between  these  locations,  and  single  common  source  or  destina¬ 
tion  (central  site).  In  order  to  retain  simplicity  and  low  cost  in 
the  terminal  hardware,  such  networks  are  configured  as  trees 
comprised  of  communication  facilities  of  a  single  capacity. 
Thus,  the  optimal  solution  to  this  problem  is  a  capacitated 
minimal  spanning  tree  (CMST),  i.e.,  a  tree  of  minimum  total 
length  satisfying  a  constraint  or  set  of  constraints  that  limit 
the  total  traffic  and/or  number  of  nodes  in  any  subtree  rooted 
at  the  central  site.  In  centralized  networks,  such  subtrees  are 
called  multipoint  lines. 

More  formally,  we  are  given  a  set  of  N  locations  (nodes),  in 
addition  to  the  central  site ;  a  symmetric  distance  measure  D  = 
{dj/li,  /  =  0,  I,  — ,  N},  giving  the  cost  between  any  pair  of 
locations;  and  a  constraint  M  on  the  total  number  of  nodes  or 
traffic  in  a  multipoint  line.  We  seek  a  spanning  tree  T  rooted  at 
the  center,  satisfying  the  constraint,  and  of  minimum  total 
length. 

This  fonnulation  is  quite  general.  All  we  require  of  the  cost 
function  is  that  the  cost  of  a  link  dj/  not  depend  upon  what 
other  links  arc  present  in  the  solution.  The  constraint  is,  like¬ 
wise,  quite  general.  One  can,  in  fact,  have  more  than  one  con¬ 
straint,  and  the  constraints  may  be  on  total  traffic,  number  of 
nodes,  nodal  degree,  number  of  links  in  cascade,  or  any  other 
quantity  associated  with  the  design.  We  require  only  tliat  if 
some  subtree  5  does  not  satisfy  the  constraints,  then  no  other 
subtree  S',  containing  5,  satisfies  the  constraints.  We  also  as¬ 
sume  that  a  star  solution,  i.e.,  all  nodes  connected  directly  to 
the  center,  is  feasible.  This  can  always  be  satisfied  by  split¬ 
ting  a  location  into  two  or  more  nodes. 

Several  heuristics  and  optimal  techniques  have  been  devel- 
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oped  for  the  solution  of  this  problem.  The  optiiiiaJ  tecliiiio ucs 
lll-|31  are  branch  and  bound  procedures  winch  in  ycnci.U 
have  runnuig  times  winch  arc  e.itponi  r.lial  in  tlic  nuinlier  oi 
riodes.  While  they  are.  thus,  not  pracUcai  as  design  procedure-,, 
results  obtained  using  tlieni  (in  particular,  a  recently  dcvclopc  . 
procedure  [7|)  have  led  to  some  msiglu  into  the  retmemeiil  oi 
heunstic  procedures.  The  procedure  desenbed  below  u,  an  out- 
growtli  of  this  work. 

Currently,  the  most  widely  used  procedures  for  the  solution 
of  the  CMST  problem  are  heuristics  (3) ,  [4] ,  [8| ,  [10]  winch 
produce  solutions  witltin  SO  percent  of  Uic  optimum  tin  cases 
where  the  optimum  is  known)  and  wJiich  have  running  tunes 
which  are  a  low-order  polynomial,  generally  between  quadratic 
and  cubic,  in  the  number  of  nodes.  Karnaugh  [61  has  dcvc  '.oiv  1 
a  family  of  second-order  greedy  .ilgorithms  (SOG.A's)  wiu.li 
iterate  the  above  heuristics  (which  he  refers  to  as  F  (first) 
OGA’s),  and  have  longer  running  times,  but  produce  results 
generally  2-3  percent  better  than  the  above  heuristics  The 
procedures  described  below  are  variants  of  the  general 
procedures  described  by  Karnaugh,  produc*-  results  ol  com¬ 
parable  quality,  and  are  considerably  faster  than  his.  inuec,:. 
in  many  cases  their  running  times  are  compeiitive  witii  the 
simpler  procedures  (FOGA’s)  previously  used. 

PROCEDURAL  DESCRIPTION 

The  most  often  used  heuristic  solutions  to  the  CMST  prob¬ 
lem  share  the  following  properties:  1)  their  running  time  is  a 
polynomial  in  the  number  of  nodes;  2)  in  the  absence  of  con¬ 
straints,  they  will  yield  a  minimum  spannmg  tree  (MST),  and 
3)  the  quality  of  the  solution  (i.e.,  the  amount  by  which  it  dif- 
fen  from  the  optimum)  is  not  controllable  and,  except  very 
loosely,  is  not  known. 

The  basic  heuristics  used  to  solve  CMST  problems  can  c? 
divided  into  two  categories- primal  procedures  which  seek  tc 
improve  a  feasible  starting  solution  or  partial  solution,  ana 
dual  procedures  which  seek  to  make  a  low  cost  (infeasible) 
starting  solution  feasible.  We  concentrate  on  primal  proce¬ 
dures,  having  found  them  to  be  generally  more  flexible.  It 
has  been  shown  (8)  that  most  such  procedures  fall  withhi  the 
framework  of  the  following  scheme. 

1)  Start  with  each  node  on  a  separate  multipoint  line 
directly  connected  to  the  center.  Associate  a  weight  w,  with  each 
node  i  by  applying  a  given  rule  (w-rule).  For  each  potential 
Link  interconnecting  a  pair  of  nodes  i  and  /,  define  a  trade- 
olf  function  i„  as  </,,  —  w,. 

2)  Consider  Uie  (not  previously  considered)  for  which 

is  minLmuni.  if  nodes  i  and  /  are  in  separate  multipoint  lines 

and  the  subtree  formed  by  merging  these  lines  does  not  violate 
any  constraint,  tl'.en  add  to  the  network,  replacing  L,o  or 
LjQ,  whichever  is  more  costly.  If  not,  reject 

3)  Update  tlic  w,  and  f,,  and  return  to  Step  2)  until  no 
furtlier  gain  can  be  obtained.  Usually,  w,  =  w,  for  nodes 
1  and  I  in  the  same  multipoint  line. 

The  implementation  of  such  a  procedure  has  been  considered 
in  detail  [9],  A  careful  implementation  is  shown  to  be  of 
ordei  .\  ^  log2  A'  where  iV  is  the  number  of  locations  and  if  one 
wishes  to  consider  all  branches.  It  is  of  order  .V  K  logj  A'  if  one 
only  examines  the  A  nearest  neighbors  of  each  node  and  if  tlic 
w-rulc  itself  is  not  too  computationally  complex  In  practice, 
the  most  widely  used  K  -rule  is  to  set  tlic  weigiii  of  eac)i  node 
in  a  subtree  to  be  tlie  minimum  distance  between  any  node  in 
the  subtree  and  the  center.  This  is  known  as  Uie  Fsau-WUliams 
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•iBOhthm.  Karnaugh  (6]  described  an  alternate  implementa¬ 
tion  of  the  Esau-Williams  procedure  which  hat  a  different 
computational  complexity,  generally  between  quadratic  anO 
cubic,  but  which  produces  identical  results.  While  the 
SOGA's  described  below  can  work  with  any  imbedded  FOt.A, 
for  the  sake  of  comparison  with  previous  work,  we  perfornied 
experiments  with  an  imbedded  Esau-Williams  algorithnL 

In  order  to  implement  a  SOGA,  one  must  decide  which  at 
one  will  attempt  to  forte  in  or  out  of  the  solution.  Kamau(;h 
suggested  two  possibilities  which  are  briefly  deacribed  below. 
The  reader  is  referred  to  (6]  for  a  complete  deacription. 

1)  Inhibit- At  eacii  stage  in  its  execution,  the  FOGA  brings 
in  one  arc  connecting  two  previously  unconnected  sets  of 
nodes.  The  Inhibit  loop  successively  prevents  each  of  ihi* 
mergers  which  took  place  in  the  previous  iteration  from  taking 
place.  Thus,  one  iteration  of  the  Inhibit  loop  involves  up  to  A' 
(where  iV  is  the  number  of  terminala)  iterations  of  the  FOGA, 
each  of  which  prevents  a  single  pair  of  node  clusters  from 
merging.  At  the  end  of  each  iteration  of  the  Inhibit  loop,  the 
best  of  the  generated  sohitiona  (and  its  associated  cluster 
inhibitions)  is  kept  and  used  in  place  of  the  original  (unhibited) 
FOGA  solution,  and  the  Inhibit  loop  is  repeated  until  no 
further  progress  is  made. 

2)  Join- For  each  node  a/,  find  its  nearaat  neighbor  and 
nearest  neighbor  among  those  nodes  wliich  are  closer  to  the 
center  than  Cf.  Successively,  one  arc  at  a  time,  force  arcs  (cj, 
hj)  and  («/,  c/)  into  the  solution  if  they  are  not  already  present. 
One  Join  loop  consists  of  a  sequence  of  executions  of  the 
FOGA,  each  with  a  single  forced  arc.  The  Join  loop  is  iterated, 
starting  from  the  best  solution  obtained  during  the  previous 
loop,  until  no  further  progress  is  made.  The  number  of  FOGA 
iteiations  in  a  Join  loop  is  bounded  from  above  by  IN  —  1 
but  is,  on  the  average,  somewhat  leaa  than  Af,  the  exact  number 
being  a  function  of  the  particular  problem. 

Thus,  both  the  Inhibit  and  Join  prooeduies  have  running 
times  of  order  CN  tiroes  the  ruiuiing  time  of  tiie  embedded 
FOGA,  the  Join  procedure  being  somewhat  faster.  The  factor 
C  is  the  number  of  iterations  required  for  convergence,  Le., 
to  reach  the  point  of  no  further  progress.  It  would  be  possible 
to  greatly  improve  the  running  time  of  Che  SOGA  if  one  could 
restrict  one’s  attention  to  a  small  (ie.,  <  JV)  subset  of  forced 
uiclusioni  or  exclusions.  Also,  Kamauih  mentioned  that  the 
Join  procedure,  while  somewhat  faster  than  Inhibit,  is  weaker 
because  it  restricts  ittelf  to  local  transformatioru  involving 
nearest  neighbors. 

The  procedure  described  in  [7]  is  a  branch-and-bound 
algorithm  for  obtaining  optimal  solutions  to  CMST  problems. 
While  it  does  not  converge  to  an  optimal  solution  quickly 
enough  to  be  useful  as  a  design  procedure,  it  was  sufficiently 
effective  to  be  useful  in  obtaining  optimal  solutions  to  a  num¬ 
ber  of  problems  of  sufficient  size  and  constraint  tightness  to 
be  able  to  make  observations  about  characteristics  of  optimal 
solutions.  This  led  to  tlie  characterization  of  arc  subsets  whicli 
are  small  and  at  tlie  same  time  reasonably  effective  for  use  as 
candidates  in  a  SOG.a.  In  particular,  it  was  found  tliat  arcs 
present  in  optimal  solutions  but  not  present  in  heuristically 
generated  solutions  were  almost  always  MST  arcs.  While  it  is 
not  necessarily  to  that  all  ares  present  in  the  optimal  solution 
and  not  in  the  hsau-Wii'Lims  solution  are  MST  arcs,  in  all  ine 
experiments  run  and  ciiLcked  lo  this  eiui,  invariably  at  least 
one  arc  in  the  optimum,  but  not  in  Uie  Esau-Williams  solution, 
was  an  MST  arc. 

This  is  not  to  say  tliat  it  is  conjectured  that  one  must  al¬ 
ways  be  able  to  find  an  optimum  solution  wliicli  contains  an 


MST  arc  not  pait  o!  !i;t'  i  ..iii  U'uiiairis  solution  (when  the 
Esau-Williams  .solutio  osilf  is  not  op'ini.iiV  i;  -rirrely 

sij  i  ns  tin'  ,  i.'.l  .ii.c  in  lea-.i  .nahle  i  .o  ex- 

ainiiio  when '.ci'Kiiij’  t'  .r ...  mi  ition. 

Even  w.'l  lit  (h.  ■  ■  '  i:  .s  .lui  ;■  iy  ap- 

to  .!fi  !■  !  j  .  ■'  t  mna]  hn'-rtici;  .i);are  the 
jiro.'it;.  '-■1,';  1'  .  .  .  L  .odes  to  ihr.ii 

'P'  ■  i ■ ; ilir-  ,  ijff  ■  n 

tioi;;  Uiifci  s  in i  ,  .  ..  .  (  asi- 

’'-i-iy  't'  '  '.r.i  -  ■  ’  ,  '  ■[>  cl.e  a  cen- 

Sidtied  dunnp  tlie  e  ;  ■  ■-  nf  :  n  if  s  yeuris- 

I,,  i,.].,  ,,s  f  ,,;1  ;  .ntJirn 

'  ’  ■  O'  .  r  -  j  cnfiCai 

node  (or  noe'est  an  oseo  'n  i.-ip  able  to  inc  de  some 

desuahlr  arc  i  oi  ate  uLion  >t  is  quite  likely  that  one 

or  more  of  these  a;  'SI  arcs,  indeed,  as  mentioned 

above,  Uiis  was  alv  ■  with  the  experiments  carried 

out. 

Thus,  a  heunst  d  v'L'.,-h  attempts  to  improve 

solutions  eeneratey  Tra!  .uimstir  by  forring  the 

inclusion  of  one  oi  :i  :.  ■  i:  .h  the  primal  heuristic 

left  out.  This  gives  iiowmi;  procedure. 

Step  I  :  General  tres  =  =  l.-'.N-l}. 

Generate  a  feasible  ~  {b/\i  =  1,  •  N  —  l}. 

Find  S  =  l-'oi  problems  of  interest,  S  is 

nonempty.) 

Sup  2:  For  ead  '  .S,  set  .Sj  =  5  —  Sj.  Remove 

all  Oj  €  5j  from  tli  tart  a  solution  by  including  all 

ay  €5)  Apply  the  .tif;  to  ompirte  the  generation 

of  a  solution. 

It  sho'uld  be  nott  'teniative  proc.edure  would  be 

to  include  Oy  in  ,  to  ei.diidr  a,  in  5^.  It  was  felt, 

however,  that  if  elei  •  were  retsined  as  candidates  for 

inclusion,  duplicate  •  would  be  likely  to  result.  The 

exclusion  of  elemen-'  uarantees  unique  solutions,  and 

hence  should  increaf'  celihood  of  generating  improve¬ 

ments. 

COMPUT  NAL  EXPERIENCE 

In  order  to  assess  ti  ffectivt  .;e-.s  and  efficiency  of  Uie 
procedural  srherne  d.''-''.r(i,i''d  in  d.-’  r , ..'■.•i'.iis  .tretion,  a  series 
of  experiments  was  nin.  Nodr.t  'vere  generated  with  random 
locations  ove;-  -  unit  -  ..at  .  problem,;  with 

N  =  40,  60,  Ph  .  i.'v’  ■  o  y  ni  c.^ncTraii,  , 

on  the  nunibei  ■'''  nodc<;  m  ■  .  x-t  ■.  cn  ■  -  ,V/20 

and  m  =  ;V/4  w-ic  (  n-  ' 

Alternatively,  a  i  ■  i.n'  "-  . mild  h-  ^e  been  used, 

but  we  did  not  Vei  ''  '  '  .f.,  ,1- 

have  been  obtained  ' 'i”-  ‘  '  -  » 

running  time  f  .-iiu 

Algontlim  witlnr.  the  .  i  i  •'  i  v  -e  i  .-Iponthm  !8l 

karnaiigh  (61  o  o  the 

Ksau-’-Villiatns  pi-rce-'  n-  ,  •  •  .  ,•  I-,  i.nnii.g  ujnr  on 

an  HIM  570/15S.  'i  i.  ^  '  ■■  i'-'-  •  o  ■,  b  ' 

KL  20/50  which  is  scmi  ha'.  '  ■  ■  :  i'  .  ■!  a  '  u; 

order  to  establish  a  tn-  •  l  ■;  ■  n  l  ,nc  -  omparLsons  wi-l 

Kaniaugh's  procrd-ir  'i  ■  -  •  e  ’  .ind  Table  |.  we  n 

compared  out  imi'!  •  •  ■  i-  t  18)  .  '  ii._, 

'.\e  found  tiiat  ho-  ;  .  and 

computational  coie  >ii'xi,;i  -  >  nnadratic  and  ibic) 

for  Uic  lirst  FOf.A  iicr-ii.ii'  i:  e  im  .ilH.uning  an  initial  solu¬ 
tion),  but  that  ou:  nil: 'l  l  I  I  II  -  IS  of  a  lower  order  of 
conti>lex!ly  for  latei  -ii  i  .  u'jerce  in  ninning  limes 
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for  later  iterations  was  roughly  a  factor  of  S/2  for  .V  =  1 20. 
Since  most  of  the  execution  time  of  the  overall  procedure  is  in 
iteration  of  the  FOGA,  we  conclude  tliat  this  coincides  with 
the  differences  in  running  times  between  the  two  versions  of 
Inhibit  in  Fig.  2. 

Next,  by  running  a  sequence  of  120  problems  for  A'  be¬ 
tween  10  and  100,  we  examined  tJie  size  of  the  subsets  .S  gen¬ 
erated,  i.e.,  the  average  number  of  MST  links  not  present  in 
the  FCXjA  solution.  Most  primal  heuristics  generate  subtrees 
which  are  MST’s  on  the  set  of  nodes  they  contain  along  with 
the  center.  Most  of  the  arcs  in  these  subtrees  (with  the  excep¬ 
tion  of  the  arc  directly  connected  to  the  center)  will  be  MST 
arcs.  Thus,  one  might  expect  that  tlie  cardinality  of  5,  |5 1  ^ 
Af/m,  and  a  vast  improvement  can  be  made  over  a  blind  branch 
exchange  or  SOGA  procedure.  In  practice,  we  found  that  i  S  \ 
ranged  between  N/m  and  ^  log  Af/m . 

There  are,  in  fact,  2‘'^'  subsets  of  S.  Thus,  for  moderately 
large  tightly  constrained  problems,  iS  could  grow  large  enough 
to  make  evaluating  all  subsets  impractical.  Furthermore,  it  is 
reasonable  to  assume  that  not  all  arcs  in  S  interact  with  one 
another  i.e.,  improvements  in  separate  parts  of  the  network 
can  be  found  and  justified  independently  of  one  another. 
Thus,  the  heuristic  was  modified  to  only  consider  subsets 
Si  C  S  such  that  |5|  |  <  A  for  some  given  K.  The  best  subset 
5|*  is  found  and  permanently  forced  into  the  solution.  We 
then  set  5  =  5  —  5 1  *  and  repeat  the  procedure  until  no  fiuther 
improvement  can  be  made.  In  practice,  K  =  2  worked  well. 
Experiments  were  run  with  larger  values  of  A;  only  in 
isolated  cases  was  any  improvement  over  K  =  2  obtained. 
(Even  A  =  1  worked  well  in  many  cases.)  Thus,  it  was  decided 
in  all  the  remaining  experiments  to  use  tl»  extended  proce¬ 
dure  and  restrict  the  examination  to  subsets  of  cardinaiity  less 
than  or  equal  to  two. 

Using  this  modifleation  of  the  new  heuristic,  it  was  observed 
that  forcing  arcs  between  nodes  which  are  close  to  the  center 
seemed  to  have  the  greatest  effect  on  tire  value  of  the  solu¬ 
tion.  This  was  probably  a  consequence  of  the  fact  that  the 
Esau-Williams  algorithm  starts  with  nodes  from  the  center, 
and  hence,  dealing  with  nodes  near  the  center  first  radically 
changes  the  solution  value. 

In  particular,  it  was  found  that  for  small  networks  (A/<  20), 
there  was  little  difference  (usually  <  1  percent)  in  perform¬ 
ance  between  the  new  heuristic  and  the  Esau-Williams  solu¬ 
tion.  This  is  almost  certainly  because  both  procedures  were 
generating  near-optimal  solutions.  However,  for  larger  net¬ 
works,  particularly  for  tightly  constrained  problems,  the  new 
heuristic  performed  noticeably  better  with  improvements  aver¬ 
aging  about  1 .5  percent.  Fig.  1  shows  the  improvement  relative 
to  the  Esau-Williams  algorithm.  As  can  be  seen,  the  improve¬ 
ment  increases  with  the  problem  size.  Results  were  even  more 
encouraging  for  large  networks  with  the  central  site  in  the 
comer.  Such  networks  may  be  viewed  as  one-fourth  of  a  net¬ 
work  of  AN  nodes  with  the  central  site  in  the  center.  For  such 
problems,  improvements  averaging  4  percent  and  as  high  as 
8  percent  were  observed.  Thus,  tlie  procedure  appears  to  be  of 
value  for  many  realistically  sized  problems  with  tight  con¬ 
straints. 

A  straightforward  implementation  of  the  Esau-Williams 
procedure  has  a  computational  complexity  of  order  log]  N. 
A  more  careful  implementation  (9|  can  reduce  the  complexity 
to  order  N  log]  N.  As  discussed  previously,  |  S  \  was  found  to 
range  between  A//m  and  N  log  N/m.  Since  we  examine  subsets 
of  cardinality  at  most  2,  subsets  are  examined  on  each 
iteration.  At  worst,  |5|/2  successive  subsets  of  cardinality  2 


Fig.  1.  Improvements  obtained  with  new  heuristic. 


Fig.  2.  Comparison  of  run  times. 

could  be  introduced,  and  thus,  the  procedure  could  iterate  at 
most  1 5 1/2  times.  In  practice,  the  number  of  iterations  grows 
more  slowly  than  |S  |. 

Thus,  a  careful  implementation  of  the  procedure  has  a 
complexity  of  |  S  I^Af  log  N. 

Finally,  a  version  of  the  Inhibit  and  Join  procedures  (as 
described  in  [6] ,  but  using  the  Unified  Algorithm  in  place  of 
the  alternate  implementation  of  Esau-Williams)  was  coded  and 
run  on  another  set  of  problems,  and  the  heuristic  described  in 
this  paper  was  then  run  on  the  same  set  of  problems.  Thus,  we 
were  able  to  directly  compare  the  quality  of  the  obtained  solu¬ 
tions,  as  well  as  the  running  times  of  the  new  heuristic  with 
Inhibit  and  Join.  The  running  times  are  summarized  in  Fig.  2 
and  Tables  I  and  11.  As  can  be  seen  tlie  new  heuristic  is  much 
faster  than  Inhibit  and  faster  than  Join.  Indeed,  it  is  only  two- 
three  times  slower  than  the  Unified  Algorithm. 

'file  quality  of  Uie  solutions  obtained  varied.  Overall,  the 
results  were  consistent  with  the  experiment  described  above, 
although  a  drop  in  effectiveness  is  noted  for  tlie  120-node 
problems.  We  do  not  consider  this  latter  phenomenon  signifi¬ 
cant  since  there  was  a  large  variation  in  individual  runs,  and 
the  first  experiment,  which  was  based  on  a  much  larger  num- 
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TABl  K  I 

ACOMI'AHISON  Oh  Run  TlMt  (IN  SK.CONOS) 


.Number  ol  .''.odes 


Algorithm 

40 

60 

90 

1 20 

Inhibit  (KarnAugh's 
Implementation) 

9.  0 

26.  7 

1  18 

2  40 

Inhibit 

1.  8 

9.  5 

46 

113 

Join 

1.  4 

4.  3 

12 

24 

New  Heuristic 

1.2 

3.  6 

s' 

13 

Es*u-Wilhami  (unified) 

0.  5 

1.  1 

2.  6 

4.  7 

TABLE  II 

PERCENT  IMPROVEMENTS  IN  PERFORMANCE  OVER 
ESAU-WILLIAMS 

Number  of  Nodet 


Algorithm 

40 

60 

40 

120 

.Aver^ee 

Inbibit 

1.  1 

2.0 

3.  6 

2.  6 

2.  t> 

New  Heurtgtic 

1.  4 

1.  S 

2.  6 

0.  g 

1.  * 

Jou) 

2.  1 

1.  s 

2.0 

0.  4 

1. 6 

(ICMtin  III  ceniieijred  coiiipuirr  neiwuiks.'  ArfH'orhi.  vul  I  no  I. 
pp  41  .S7.  1071 

jB)  M  Keriiju^'h.  A  nri*  ol  jl|'oitihiiis  loi  niuliip.iini  nci'*ork 

(ipliini/uhoii.  It.tt  /f.j.i)  (  ,,mn,un  lol  (OM  74.  pp  ItKL 

5().S.  May  107b 

i?)  A  Ker .hi'nhjuin  aiij  k  ll■>ll(^un.  (  rnirali/ej  ieicpnH.xsMn(L: 
nelwork  drsiyn.  in  /’f...  IttJ  Irlricmmun  t  tint  ,  l>ec  IV.'S 
pp  27  I  1-27  14 

|8|  A  Kershcnbauni  and  kk  c  (lou  A  unilicd  al^onihm  lor  designing 
mullidrop  IcIeprrK'csMnp  MciisorkN.'  It.LL  Trans  Cummun  ,  lol 
COM-22,  pp  1762-177’.  Nov  1974 

|V1  A-  Ker&henhauni,  "Conipuiing  tapaciiatcd  nonitnal  spanning  uccs 
cfficicnlly.  "  Seinorks  vol  4.  no  4.  1974 
1 10)  N  V  Rcinfcid  and  \k  R  Vogel.  Mathtmatnal  f^rogramm.ng 
Englewood  Cliffs,  NJ  Pieniii.c  Hall.  1958 


'oer  ot  problems,  exhibited  no  similar  behavior.  In  some  cases, 
the  new  heuristic  outperformed  Inhibit;  in  others,  Inhibit 
performed  birtter.  On  the  whole,  the  quality  of  the  solutions 
obtained  wit^'  Inhibit  was  2.6  percent  better  than  those  ob> 
l.iineo  using 'the  Fsau-Williams  procedure,  while  those  obtained 
using  the  new  heuristic  were  1.9  percent  better  than  Esau- 
W’iiiiams.  Botii  inhibit  and  tlie  new  heuristic  outperformed 
Join  fairly  consistently  ;  Join  averaged  a  1.6  percent  improve- 
nicnt  over  Esau~Williams. 


CONCLUSION 

We  found  the  new  heuristic  to  be  of  deTinite  value  as  it 
ootained  solutions  rouglily  2  percent  better  than  the  Esau- 
Wuiiams  procedure  without  greatly  increasing  the  running 
time.  In  comparison  with  Kamougli's  SOCJA's,  the  new  heuris¬ 
tic  IS  much  faster  and  obtains  solutions  somewhat  better  tlian 
Join  and  somewhat  worse  tlian  Inliibit.  We  conclude  that  tlie 
new  heuristic  is  useful  as  a  practical  design  procedure,  even 
when  imbedded  in  a  larger  procedure  which  solves  more 
global  problems. 
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Abstract 

The  problem  considered  is  that  of  finding  an  optimal  (minimum 
cost)  design  for  a  centralized  processing  network  given  a  set  of 
locations,  traffic  magnitudes  between  these  locations,  and  a  single 
common  source  or  destination.  Several  heuristics,  which  are  efficient 
(in  terms  of  their  execution  time  and  memory  requirements  on  a 
digital  computer)  and  which  produce  seemingly  good  results,  have 
already  been  developed  and  are  currently  accepted  techniques.  Some 
work  has  also  been  done  on  finding  optimal  solutions  to  this  problem 
both  as  a  design  tool  and  as  a  means  of  verifying  the  effectiveness  of 
proposed  heuristics.  We  focus  in  this  latter  area.  Currently  known 
techniques  for  the  optimal  solution  of  this  problem  via  integer  pro¬ 
gramming  have  fallen  short  of  the  desired  objectives  as  they  require 
too  much  memory  and  running  time  to  be  able  to  treat  problems  of 
realistic  size  and  complexity.  We  develop  an  improved  technique 
which  is  capable  of  handing  more  realistic  problems. 
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1.  INTRODUCTION  AND  PROBLEM  STATEMENT 

The  problem  considered  is  that  of  finding  an  optimal  (minimum 
cost)  design  for  a  centralized  telecommunication  network  given  a  set 
of  locations,  traffic  magnitudes  between  these  locations,  and  a  single 
common  source  or  destination.  The  vast  majority  of  telecommunication 
networks  currently  in  existence  are  of  this  type.  Thus,  this  problem 
has  been  much  studied  (2,3,4,5,8,9,12,16,24,28,31,32). 

Several  heuristics,  which  are  efficient  (in  terms  of  their  execu¬ 
tion  time  and  memory  requirements  on  a  digital  computer)  and  which 
produce  seemingly  good  results,  have  already  been  developed  and  are 
currently  accepted  techniques.  Some  work  has  also  been  done  on 
finding  optimal  solutions  to  this  problem  as  a  means  of  verifying  the 
effectiveness  of  proposed  heuristics.  Currently  know  techniques  for 
the  optimal  solution  to  this  problem  via  integer  programming  have 
fallen  short  of  the  desired  objective  as  they  require  too  much  memory 
and  running  time  to  be  able  to  treat  problems  of  realistic  size  and 
complexity.  We  develop  an  improved  technique  which  is  capable  of 
handling  problems  of  realistic  size. 

More  formally,  the  problem  considered  here  is  that  of  finding  a 
minimum  spanning  tree  subject  to  one  or  more  constraints  which  in 
general  are  equivalent  to  demanding  that  the  sum  of  the  traffic  asso¬ 
ciated  with  the  nodes  in  any  subtree  must  not  exceed  some  predeter¬ 
mined  maximum. 

A  minimum  spanning  tree  is  a  loop-free  collection  of  arcs  joining 
a  set  of  nodes  such  that  the  sum  of  the  lengths  of  the  arcs  is  mini¬ 
mal.  In  the  case  of  a  communication  network,  these  collections  of 
arcs  are  called  multidrop  lines. 
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It  should  be  noted  that  this  constraint  form  is  quite  general  .ind 
encompasses  many  real-world  constraints  which  .uise  in  the  design  ot 
centralized  telecommunications  networks.  Thus,  for  example,  in 
addition  to  treating  the  obvious  constraint  imposed  by  line  capacity, 
it  is  possible  to  treat  a  restriction  on  the  number  of  terminals  on  a 
multidrop  line  by  associating  a  uniform  traffic  with  each  terminal. 
Also,  the  length  (cost)  functions  which  can  be  treated  are  quite 
general.  Any  function  which  is  not  a  function  of  the  tree  chosen  is 
permissable. 

Formally,  we  seek  to  solve  the  following  problem: 

Given 

1.  A  vertex  (node)  set  V  =  jv.li=0,l , . . .  ,n}  representing  the 

terminal  locations  in  the  network.  Node  v^  is  a  distin¬ 

guished  node  which  we  will  refer  to  as  the  center. 

2.  A  symmetric  function  giving  the  length  (cost)  djj  of  an  arc 
between  any  pair  of  locations. 

3.  A  constraint,  m,  on  the  number  of  nodes  which  may  share  a 

multidrop  line.  This  constraint  can  be  generalized  to  allow 

a  weight  or  traffic,  Cj,  to  be  associated  with  each  node  and 

to  require  that  the  sum  of  the  weights  associated  with  the 

nodes  on  any  multidrop  line  not  exceed  m. 

t  h 

We  define  the  set  of  nodes  in  the  multidrop  line  to  be  and 

the  multidrop  line  itself  to  be  a  minimal  spanning  tree  7’^,  on  V. 

i 

Thus,  the  constraint  can  be  staled  in  terms  of  the  cardinality  of  Vj 

as  |V.|  <  m  V..  In  the  more  general  form,  the  constraint  would  be 

J  1  * 

1  c.  <  m  V..  We  wish  to  find  a  tree,  T^  of  minimum  total 
Vj  c  Vj  '  '  I 

length  satisfying  the  constraint  in  3  above,  'lhat  is,  we  wish  to 


ot  Vj,  i.e.,  the  node  closest  to  v.  on  tl'-  poth  between  v.  .jiui  in 
T,  and  T  is  any  spanning  tree.  We  consi  er  exo  t  (optimal)  scjlutions 
to  this  problem.  The  primary  motivation  tor  the  work  is  to  devi  iuf) 
an  exact  algorithm  capable  ol  [lermitting  study  ol  the  (xatormance  .,! 


heuristics  on  a  broader  class  ot  problems  than  was  previcjusly  studied, 
to  gain  insight  into  the  performance  of  both  exact  and  heuristic 
procedures  and,  in  particul.ii  ,  to  pinpoint  where  and  why  they  toil. 


II.  OUTLINE  OF  A  NEW  OPTIMAL  SOLUTION  TECHNIQUE 

There  currenty  exist  several  techniques  which  will  yield  optimal 
solutions  to  the  CMST  problem.  These  techniques  can  be  divided  into 
two  classes  -  branch  exchange  methods  (as  proposed  by  Lin  (22)  and 
Frank  (9))  and  branch  and  bound  methods  (10,22).  We  concentrate 
on  the  latter  class  of  techniques. 

The  specific  application  of  branch  and  bound  techniques  to  the 
solution  of  the  CMST  problem  was  proposed  by  Chandy  and  Russell 
(3)  and  was  subsequently  refined  (2)  so  that  it  could  treat  somewhat 
more  meaningful  problems.  Subsequently,  Elias  and  Ferguson  (4) 
proposed  further  refinements  and  thereby  expanded  the  range  of 
applicability  of  the  technique.  Gavish  (34)  recently  developed  a 
bound  using  Lagrangean  relaxation. 

The  basic  technique  is,  as  has  already  been  mentioned,  a  branch 
and  bound  algorithm.  The  original  problem  considered  has  all  branches 
in  the  category  "permissible,"  i.e.,  any  branch  may  or  may  not  be 
part  of  the  final  solution.  Subproblems  are  generated  by  selecting  a 
permissible  branch  and  making  it  "prohibited"  in  one  subproblem  or 
"required"  in  another. 

The  relaxation  used  is  simply  to  generate  a  modified  MST  by 
including  all  "required"  branches,  excluding  all  "prohibited"  branches, 
and  forming  the  tree  of  minimum  total  length  by  connecting  (as  yet 
unconnected)  nodes  using  remaining  ("permissible")  branches. 

Clearly,  a  solution  obtained  in  this  manner  is  a  lower  bound  on 
the  value  of  a  feasible  solution  to  the  subproblem  as  it  is  the  tree  of 
minimum  length.  Note  also  that  in  the  case  where  all  arcs  are  speci¬ 
fied  (prohibited  or  required),  the  lower  bound  and  solution  are 
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identical  and  the  subproblem  fathoms.  In  general,  the  subproblem 
fathoms  when 

1.  No  feasible  solution  exists  to  the  subproblem.  This  occurs, 
when  the  required  branches  form  a  loop,  when  the  required 
branches  create  a  subtree  violating  the  constraints,  or 
when  the  prohibited  branches  disconnect  the  network. 
Other  criteria  exist  but  are  difficult  to  test  for. 

2.  The  lower  bound  equals  or  exceeds  the  value  of  the  best 
solution  found  thus  far. 

3.  The  lower  bound  solution  is  feasible. 

When  all  subproblems  have  fathomed,  the  current  best  solution  is 
the  global  optimum. 

A  number  of  observations  have  been  made,  which  can  be  used  to 
accelerate  a  basic  branch  and  bound  technique.  One  of  these,  which 
is  used  in  the  sequel,  is  given  in  Theorem  1  below; 

Theorem  1:  (3) 

If  branches  (v^,v.  ),  (v^,v.  ) - -  (Vf.,v.  ),  are  part  of 

^  ^  >2 

some  MST,  T,  on  V  then  there  exists  a  CMST  including 
these  edges. 


Corollary; 


If  arcs  (vQ,Vj  ),  (v^.v-  (Vq.v^  )  are  present  in  the 

2  K 

modified  MST  produced  in  any  subproblem,  then  (if  any 


CMST's  exist  in  the  subproblem)  ther^  exists  a  CMST  on 


the  subproblem  containing  these  arcs. 

The  preceding  theorem  and  corollary  allow  one  to  avoid  consider¬ 
ing  subproblems  with  such  arcs  prohibited.  The  techniques  devel¬ 
oped  in  the  sequel  make  explicit  use  of  both  observations  as  well  as 
others  made  in  the  references  cited. 
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The  inherent  problem  with  the  existing  procedures  lies  in  the 
relaxation  method  used.  At  each  step,  the  problem  is  relaxed  to  a 
modified  MST.  Unfortunately,  this  bound  is  often  too  loose  to  elimi¬ 
nate  a  sufficiently  large  percentage  of  the  subproblems  to  make  the 
procedure  practical.  This  is  particularly  evident  when  the  constraints 
are  tight;  it  is  for  such  problems  that  the  relaxation  is  loosest. 
Unfortunately,  it  is  also  for  that  class  of  subproblems  that  the  known 
heuristics  display  the  widest  variation  in  the  quality  of  solutions. 

Note  that  any  optimal  solution  to  the  CMST  problem  has  the 
property  that  all  subtrees  are  MST's  on  the  set  of  nodes  contained  in 
the  subtree  and  the  center.  Thus,  it  suffices  to  find  the  optimal 
partition  of  the  nodes  into  subtrees.  The  technique  which  is  devel¬ 
oped  in  the  following  sections  will  thus  generate  partitions  of  the 
nodes . 

The  technique  works  within  the  framework  of  branch  and  bound 
algorithms,  as  did  the  techniques  referred  to  above.  We  develop  two 
algorithms ,  one  based  on  generating  subproblems  by  restricting 
nodes,  and  the  other  based  on  generating  subproblems  by  restricting 
arcs.  These  techniques  differ  from  previous  ones  in  that  the  relaxa¬ 
tion  used  here  is  tighter  and  thus,  a  smaller  number  of  subproblems 
need  be  examined. 

We  begin  by  restricting  the  problem  slightly.  We  seek  a  CMST 
subject  to  the  constraint  that  the  number  of  nodes  (rather  than  the 
sum  of  the  weights  of  the  nodes)  in  any  subtree  not  exceed  a  pre¬ 
specified  maximum.  Since  our  primary  intent  here  is  to  study  the 
performance  of  CMST  algorithms,  this  modification  would  not,  in 
general,  have  a  significant  effect.  Indeed,  if  one  preferred,  a  node 


of  weight  K  could  be  replaced  by  K  nodes  of  weight  1,  providing  one 
is  willing  to  allow  the  original  node  of  weight  K  to  be  split  among 
more  than  one  subtree. 

To  find  a  partition  of  the  nodes  V  =  {Vj|  i  =  l,2,...n}  into 
subtrees,  we  begin  by  making  n  copies  of  each  node  corresponding  to 
the  possibilities  of  the  node  being  in  any  of  n  possible  subtrees. 
Thus,  v.|  corresponds  to  node  Vj  being  in  subtree  j. 

The  problem  of  obtaining  an  optimal  partition  of  nodes  into 
subtrees  can  be  thought  of  as  one  of  selecting  an  optimal  subset  from, 
the  set  E  =  {v^l  i=l,2,...n;  j=l,2,...n}.  Feasible  subsets  of  E, 
i.e.,  those  corresponding  to  partitions  satisfying  the  capacity  con¬ 
straint,  will  contain  1  v..  for  each  i  and  at  most  m  v./s  for  each  j. 
If  we  associate  a  weight,  Wjj,  with  each  v^j,  the  optimal  subset  of  E 
(hence  the  optimal  partition  of  V)  is  defined  as  the  feasible  subset  of 
minimum  total  weight. 

An  efficient  algorithm  (see  14,  21,  35)  exists  for  the  solution  of 
the  problem  of  finding  the  optimal  subset  of  E  given  the  values  of 
Wjj.  The  algorithm,  which  can  be  thought  of  as  a  matroid  interesec- 
tion  [1,17,19,20,29]  algorithm  or  alternatively  as  a  series  of  shortest 
path  problems  in  appropriately  defined  graphs,  has  a  worst  case 
running  time  of  order  n^  and  in  practice  has  a  running  time  closer  to 
order  n^  (see  35).  Unfortunately,  the  set  of  weights,  w^j,  which 
correspond  directly  to  the  "cost"  (contribution  to  the  overall  length 
of  the  CMST)  of  Vj  in  subtree  j  can  be  specified  only  when  the  pro¬ 
blem  solution  is  already  known.  We  can,  however,  define  a  set  of 
weights,  w..,  which  have  the  property  that  the  optimal  partition  found 
using  these  weights  will  have  a  value  (sum  of  weights)  which  is  a 


lower  bound  on  the  length  of  the  optimal  CMST.  Thus,  we  can  relax 
the  CMST  problem  to  the  problem  of  finding  an  optimal  partition. 
This,  together  with  generating  subproblems  by  successively  restrict¬ 
ing  either  nodes  or  arcs,  gives  rise  to  an  optimum  CMST  algorithm 
within  the  branch  and  bound  context. 

An  appropriate  set  of  weights,  w^^,  can  be  defined  as  follows. 
Suppose  we  are  given,  for  each  subtree  j,  the  set  Vj  of  nodes  per¬ 
mitted  in  the  subtree;  a  procedure  for  obtaining  the  V.  will  be  given 
below.  One  can  then  find  T^,  the  minimum  spanning  tree  on  the 
nodes  in  ^  ^ij  length  of  the  arc  connecting  v^ 

to  its  predecessor  in  Tj  (i.e.,  d^j  is  the  length  of  the  last  arc  in  the 
path  from  Vq  to  Vj  in  T|).  The  following  theorem,  which  is  proven  in 
(35),  allows  us  to  obtain  appropriate  Wj.: 

Theorem  2:  The  weight  of  the  optimal  partition  using  w^^  =  dj^  is  a 
lower  bound  on  the  length  of  the  CMST  for  the  same  V  and  M. 

Furthermore,  it  is  proven  in  (35)  that  other  similarly  defined 
w.|'s  also  preserve  this  lower  bound.  In  particular,  suppose  T^ 
contains  a  path  (vQ,...Vp,  v^, . .  .v^, . . .  v^, . . . )  as  shown  in  Figure 
1.  Let  S  be  the  set  of  nodes  {v  ,...v,  ,...v  }  and  let  w,  be  the 

C|  K  S  K 

largest  weight  of  any  node  in  S.  Suppose  Wp^  >  Wj^. .  Define  A  = 
Wpj  -  Wj^j.  Then  the  following  theorem  holds: 

Theorem  3:  If  a  set  of  weights  w^j  =  dj^  is  modified  by  transferring 
weight  A  from  Wp.  to  w^j  where  A,  Vp  and  v^  are  defined  as  above, 
the  weight  of  the  optimal  partition  is  still  a  lower  bound  on  *^he  length 
of  the  CMST  for  the  same  V  and  m. 

Theorem  3  allows  us  to  transfer  weight  from  a  node  to  its  suc¬ 
cessors  in  Tj  in  order  to  guarantee  that  the  lower  bound  obtained 
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from  the  partitioning  problem  is  at  least  as  tight  as  the  bound  ob¬ 
tained  using  an  MST,  as  is  done  in  (2),  (3),  and  (4).  A  proof  that 
this  can  always  be  done  is  given  in  (35).  As  an  example  of  how  this 
works,  consider  the  network  shown  in  Figure  2a.  The  MST  for  this 
network  and  the  node  weights  corresponding  to  it  are  shown  in  Figure 
2b.  These  weights  correspond  to  the  bound  obtained  using. an  MST. 
Suppose,  however,  that  we  restrict  V3  from  being  part  of  a  given 
subtree  j.  The  weights  shown  in  Figure  2c  would  then  be  obtained  if 

we  simply  set  w..  =  d...  Note  in  particular  that  w».  has  been  reduced 
1]  1] 

from  5  to  1.  This  reduction  in  W2j  could  result  in  a  loosening  of  the 
lower  bound.  Theorem  3  allows  us  to  transfer  up  to  A  =  w^^  -  W2j, 

i.e.,  7  units  of  weight,  from  w^^^  to  W2j  and  obtain  the  weights  shown 

in  Figure  2d.  Note  that  the  Wj^  in  Figure  2d  are  at  least  as  great  as 
the  w..  in  Figure  2b.  Thus,  the  lower  bound  obtained  using  the  w^^ 
in  Figure  2d  will  be  at  least  at  tight  as  that  obtained  using  an  MST. 

In  fact,  the  bound  so  obtained  is  significantly  tighter,  as  is  shown 

by  the  computational  experience  given  in  Section  V. 

We  now  turn  to  the  question  of  the  branching  rule  within  the 
branch  and  bound  procedure.  Little  was  said  by  Chandy  and  Russell, 
Chandy  and  Lo,  and  Elias  and  Ferguson  on  the  order  in  which  sub¬ 
problems  are  considered  in  the  branch  and  bound  procedure.  Classi- 
cially,  two  approaches  are  available.  The  first  is  to  always  consider 
the  subproblem  with  the  least  lower  bound.  Alternatively,  one  can 
use  depth  first  search,  where  one  always  solves  most  recently  gener¬ 
ated  subproblems  before  returning  to  older  subproblems.  There  are 
advantages  and  disadvantages  to  both  approaches. 


The  first  approach  allows  one  to  proceed  without  any  good 
feasible  solutions  to  guide  the  process.  The  assumption  is  that 
subproblems  with  the  lowest  lower  bounds  will  give  rise  to  the  best 
feasible  solutions.  Hence,  one  prefers  to  explore  these  subproblems 
first  in  the  hope  that  they  will  give  rise  to  low  cost  feasible  solutions 
which  will  eliminate  other  subproblems  (with  higher  lower  bounds) 
from  consideration.  Also,  by  examining  subproblems  in  this  order, 
one  is  continually  narrowing  the  range  between  the  upper  and  lower 
bounds,  and  hence,  has  the  option  of  terminating  the  algorithm  when 
the  interval  shrinks  to  some  prespecified  width. 

There  are,  however,  two  major  drawbacks  to  this  approach. 
First,  one  must  keep  (a  potentially  large  number  of)  subproblems 
around  in  order  to  select  the  next  one.  Thus,  the  storage  required 
for  the  procedure  is  potentially  exponential.  In  practice,  it  was 
storage,  not  running  time,  which  was  the  active  constraint  on  problem 
size  in  previously  developed  techniques.  One  could  temporarily  store 
subproblems  in  secondary  storage,  but  this  would  complicate  and  slow 
down  the  procedure. 

Second,  by  considering  problems  in  ascending  order  of  lower 
bound,  one  will,  in  general,  be  sequentially  considering  dissimilar 
subproblems.  Thus,  one  cannot  easily  take  advantage  of  information 
obtained  in  the  solution  of  one  problem  for  the  solution  of  another. 
For  example,  in  the  Elias  and  Ferguson  technique,  the  similarity 
between  modified  MST's  for  related  subproblems  cannot  be  easily 
exploited  if  this  first  procedural  outline  is  adopted. 

Using  depth  first  search  overcomes  both  of  these  objections. 
Indeed,  a  great  deal  of  simplification  is  obtainable  both  in  the  genera- 
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tion  of  subproblems  and  in  obtaining  solutions  owing  to  the  similarity 
of  successively  considered  subproblems. 

The  maximum  number  of  subproblems  which  need  be  kept  around 
at  any  time  is  bounded  by  the  number  of  nested  specifications  it  is 
possible  to  make.  Thus,  if  one  is  restricting  nodes,  the  bound  is  n; 
if  one  is  including  or  excluding  arcs,  the  bound  is  C^)-  This  essen¬ 
tially  eliminates  storage  as  an  active  constraint  on  the  size  of  the 
problem  which  can  be  considered. 

A  further  reason  for  using  depth  first  search  is  that  the  major 
reasons  one  would  ordinarily  choose  the  first  procedure  are  not 
present  here.  Any  of  the  existing  heuristics  can  be  used  to  quickly 
generate  a  good  upper  bound.  Furthermore,  the  procedures  devel¬ 
oped  in  the  sequel  lend  themselves  to  generating  feasible  solutions  for 
all  subproblems.  Thus,  a  good  upper  bound  is  always  available. 

Hence,  depth  first  search  is  used  in  developing  the  techniques 
in  the  sequel.  It  should  be  further  noted  that  the  philosophy  used 
in  developing  these  techniques  was  to  create  the  simplest,  most  flex¬ 
ible  framework  within  which  to  work  so  that  a  variety  of  acceleration 
techniques  could  be  developed  and  tested.  The  concentration  is  on 
restricting  the  number  of  subproblems  examined  (which  is  exponential 
in  n)  rather  than  the  amount  of  work  spent  on  each  subproblem 
(which  is  a  low  order  polynomial  in  n). 
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in.  NODE  PARTITIONING 


The  first  exact  technique  built  around  the  above  relaxation  is 
one  which  generates  subproblems  by  restricting  the  subtrees  a  node 
is  allowed  into.  The  procedure  is  described  below.  We  begin  by 
describing  the  initialization  procedure. 

Step  0:  (Initialize) 

* 

0.1)  Find  an  upper  bound,  z  ,  using  a  heuristic  to  generate 
a  good,  feasible  solution. 


0.2) 


0.4) 


0.5) 

0.6) 


0.7) 


0.8) 


Find  an  MST,  T,  and  identify  arcs  (v^,v.  ),  (v^,v.  ), 

(J  1^ 


0.3)  Reorder  the  nodes  so  that  v.  ,v.  ,...v.  are  now  v, 

*1  *2  ‘k 

r TRUE  i  ^  j 


^2 . ''k- 


For  1  <  n,  set  R.j  = 


I 


FALSE  i  =  j 


[Rjj  is  a  logical  variable  which  is  set  to  TRUE  if  v..  has 
been  removed  from  consideration  in  this  subproblem. 
Observations  made  above  allow  us  to  remove  some  v 
immendiately  ] . 


ij 


For  k  <  i  <  n,  set  R.j  = 


r  TRUE  i  >  j 
V.  FALSE  i  <  j 


For  i  1  j  <  n,  find  an  MST,  Tj,  on  Vj  v^,  where 

^  A 

Vj  =  {v.|R.j  =  FALSE}  i.e.,  Vj  is  the  set  of  nodes 
permitted  in  subtree  j. 

Set  w.j  =  d^j,  where  d-  is  the  distance  from  v.  to  its 
predecessor  in  Tj. 

For  j  =  1,2,...,  k,  exchange  weight  between  w..  for 


different  values  of  i  so  that  the  resulting  modified 


weights,  Wjj  ,  satisfy: 

'^ij  -  '^iO 

where  w.q  =  d.p  in  the  unconstrained  MST,  T,  gener¬ 
ated  in  Step  0.2  above. 

0.9)  For  j  =  k+1,...,  n,  exchange  weight  between  Wjj  so 
that  the  resulting  w.'.  satisfy 

Wj'j  >  «!.  -  1 


The  justification  for  all  of  these  steps  was  given  in  Section  2. 

Steps  0.8  and  0.9  guarantee  that  the  individual  w.'j  will  all  be  at 
least  as  great  as  the  weights  assigned  using  unconstrained  MST. 
Hence,  this  is  a  realization  of  the  statement  that  the  lower  bound 
obtained  using  this  procedure  must  be  at  least  as  great  as  the  lower 
bound  obtained  using  an  MST.  This  also  holds  true  for  subproblems. 
Thus ,  we  have  initialized  a  subproblem  with  nodes  1,2,...,  k  forced 
into  subtrees  1,2,...,  k,  respectively,  since,  for  i  <  k,  R.j  =  TRUE 
for  i  5^  j.  In  the  course  of  the  depth  first  search,  we  keep  track  of 
the  following  variables: 

d  =  The  depth  of  the  search ,  i .  e . ,  the  number  of  nodes 
which  have  been  forced,  d  is  initialized  to  k. 

^MIN  ~  minimum  allowable  depth,  is  initialized  to  k 

since  at  least  k  nodes  should  will  be  forced. 

IWd  =  The  subtree  which  the  d^^  node  is  forced  into. 

The  depth  first  search  proceeds  by  forcing  node  k+1  into  sub¬ 
tree  1;  i.e.,  d  is  set  to  k+1  and  is  set  to  1.  It  continues 

either  by  increasing  d  (to  force  another  node)  changing  IW .  (to  force 
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a  node  into  a  different  subtree)  or  decreasing  d  (to  release  a  node 
after  forcing  it  successively  through  all  subtrees).  The  depth  first 
search  procedure  follows. 

Depth  First  Search  Procedure 

Step  0:  Initialize  problem  (Steps  0.1  through  0.9  above). 

Set  d  =  k+1 

Set  =  k+1  Set  IW^  =  1 

Step  1:  Solve  the  currently  defined  subproblem;  i.e.,  find  a 
lower  bound  Zj^,  and  a  feasible  solution  Zp. 

Step  2:  If  the  current  subproblem  fathoms;  i.e.,  z^  >  Zp,  go 
to  Step  3;  otherwise  go  to  Step  5. 

Step  3;  Set  IW^  =  IW^  +  1 

If  IW^  >  NMAX^  go  to  Step  4;  otherwise  set  up  a  new 
subproblem  and  go  to  Step  1.  NMAX^j  is  the  highest 
indexed  subtree  which  the  node  at  depth  d  may  be 
forced  into.  In  section  3  we  observed  that  one  should 
not  skip  over  subtrees.  Thus: 

NMAXj  =  max  (K,  max  [IW.])  K  <  i  <  d 
Step  4:  Set  d  =  d+1 

If  d  <  dj^jj^j  stop;  otherwise  set  up  a  new  subproblem 
and  go  to  Step  1. 

Step  5:  Set  d  =  d+1 
Set  IW^  =  1 

Set  up  new  subproblem  and  go  to  Step  1. 

To  set  up  a  new  subproblem,  one  need  only  modify  the  values  of 
a  few  Rjj  to  impose  (or  remove)  the  restriction  implied  by  the  alter¬ 
nation  of  d  and  IW^.  Thus,  after  d  is  set  to  d+1  and  IW^  is  set  to  1: 


I 


i 


"i 


1# 


.a 


ri 
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The  search  space  can  be  further  pared  using  the  corollary  to 
Theorem  1.  If,  in  any  subproblem,  one  finds  that  two  nodes,  v.  and 
Vj,  both  forced  into  the  same  subtree  appear  in  separate  subtrees  in 
the  MST  formed'  on  the  set  of  permissable;  nodes  in  that  subtree ,  then 
the  subproblem  may  be  discarded. 

As  was  mentioned,  this  optimal  technique  based  on  generating  a 
partition  lends  itself  simply  to  obtaining  a  feasible  solution  to  each 
subproblem.  The  partition  generated  at  each  step  is  feasible.  One 
need  only  generate  MST's  on  each  group  of  nodes  to  obtain  a  feasible 
solution.  A  simple  acceleration  technique,  which  proved  to  be  quite 
effective  in  practice ,  was  to  reorder  the  nodes  ,  •  • . ,  v^  by 
distance  from  Vq,  nearest  first.  This  tended  to  increase  the  lower 
bound  most  rapidly.  Such  nodes,  when  restricted  to  a  single  sub¬ 
tree,  were  absent  from  all  others,  and  the  "deprived"  subtrees  were 
often  forced  to  connect  to  Vq  over  longer  arcs. 

The  R.j  are  used  in  the  optimal  partitioning  procedure  in  a 
straight  forward  fashion;  any  element  v.^,  with  its  corresponding  R.. 
=  TRUE,  is  considered  to  be  removed  from  the  problem. 
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The  weight  exchange  procedure  described  as  part  of  the  initiali¬ 
zation,  which  guarantees  that  the  weight  on  each  node  will  be  at  least 
as  great  as  its  contribution  to  the  length  of  an  MST,  is  used  here  as 
well.  At  each  level,  K,  in  the  decision  tree,  we  save  the  values  of 

ir 

the  w.’.  in  a  variable  referred  to  as  w^.  We  then  demand  that  Wj'.  = 
wfj  -  '  exchange  weights  to  enforce  the  restriction.  The 

justification  for  doing  so  is  identical  to  that  used  in  the  initialization 
procedure.  Note  that  this  exchange  guarantees  not  only  that  the 
lower  bound  will  remain  tighter  than  an  MST,  but  also  that  the  lower 
bound  will  be  monotone  with  the  depth  in  the  decision  tree.  Neither 
of  these  things  is  true  without  the  exchange. 


Another  method  of  applying  this  relaxation  technique  to  the 
solution  to  the  CMST  problem  is  to  restrict  arcs ;  i ,  e . ,  to  force  arcs 
to  be  either  "prohibited”  or  "required"  as  was  done  by  Chandy  and 
Russell  and  Elias  Ferguson.  Thus,  the  initiali2ation  and  subproblem 
solution  are  essentially  the  same  as  they  were  in  the  technique  des¬ 
cribed  in  the  previous  chapter,  but  the  method  of  generating  subpro¬ 
blems  is  different.  The  solution  order  is  still  a  depth  first  search. 

Some  differences  exist  in  the  initialization  procedure.  Instead  of 
forcing  a  node  into  a  subtree,  we  simply  "require"  the  arcs  (ij^,0), 
....  (i|^,0)  which  are  part  of  the  unconstrained  MST.  This  is,  of 
course,  equivalent  to  what  was  done  in  the  previous  case.  It  is 
implemented  in  a  slightly  different  way,  however. 

The  entire  procedure,  both  during  initialization  and  during 
subsequent  subproblem  generation,  restricts  itself  to  dealing  with 
established  arcs,  i.e.,  arcs  which  connect  a  node  directly  to  Vq  or  to 
other  nodes  connected  to  Vq  by  established  arcs.  Thus,  each 
"required"  arc  forces  a  node  into  a  given  subtree  and  each 
"prohibited"  arc  forces  a  node  out  of  a  given  subtree.  As  a  new 

subtree  is  encountered  (i.e _ ,  when  an  arc  of  the  form  (Vq,  v.)  is 

made  "required"),  we  simply  assign  the  next  available  subtree  number 
to  the  subtree. 

Subproblems  are  generated  by  successively  restricting  (requiring 
or  prohibiting)  established  arcs.  We  again  use  d  to  represent  the 
depth  of  the  search.  Here,  however,  d  refers  to  the  number  of 
forced  arcs  rather  than  the  number  of  forced  nodes.  Note  that  while 


the  number  of  required  arcs  is  limited  to  n,  the  number  of  prohibited 
arcs  is  not.  We  thus  have  a  different  type  of  decision  tree  than  we 
did  in  the  previous  section. 

When  forcing  nodes  into  subtrees,  we  dealt  with  a  tree  of  depth 
n  but  with  nodes  of  degree  sometimes  as  great  as  n.  Here,  we  deal 
with  a  binary  tree  of  depth  as  great  as  C^).  It  is  not  clear,  how¬ 
ever,  especially  with  the  paring  techniques  being  used,  which  deci¬ 
sion  tree  is  actually  larger. 

The  arc  chosen  for  inclusion  is, in  each  case  the  next  arc  to  be 
brought  in  by  Prim's  MST  Algorithm  (25);  i.e.,  the  shortest  arc 
connecting  a  node  to  some  node  connected  to  Vq  by  required  arcs. 
This  has  several  advantages: 

1.  The  arc  chosen,  if  excluded,  will  tend  to  raise  the  lower 
bound.  This  is  important  as  it  helps  control  the  size  of  the 
decision  tree.  Since  a  potentially  large  number  of  succes¬ 
sive  arc  exclusions  is  possible,  it  is  important  that  an  arc 
exclusion  result  in  an  increase  of  the  lower  bound  as  often 
as  possible  so  that  the  fathoming  process  will  limit  the 
depth  of  the  search. 

2.  If  the  arc  is  of  the  form  (v^,  v.),  it  need  only  be  consid¬ 
ered  as  "required"  and  not  "prohibited."  This  is  a  direct 
consequence  of  Theorem  1. 

3.  If  the  arc  (v.,  Vj)  is  prohibited,  and  hence,  v.  is  excluded 
from  the  subtree  containing  Vj,  then  so  are  all  arcs  of  the 
form  (v.,  Vj^),  where  Vj^  is  forced  into  the  same  subtree  as 
V..  This  is  a  direct  consequence  of  an  Elias  and  Ferguson 
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lils  of  the  remainder  of  the  implementation  of  the 


dure  as  they 
ibove . 


are  similar  to  the  node  restricting 
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V.  COMPUTATIONAL  EXPERIENCE 

The  procedures  described  in  the  previous  two  sections  were 
coded  in  FORTRAN  and  run  a  PDP-10.  In  this  secton,  we  discuss 
the  results  of  experiments  run  to  test  the  behavior  of  their  run  time 
and  effectiveness  as  a  function  of  problem  size  and  constraint  tight¬ 
ness. 

Problems  were  generated  by  reading  in  n  and  m  and  generating 
random  X  and  Y  coordinates  for  the  nodes  within  a  unit  square.  The 
location  of  the  center  was ,  in  various  problems ,  either  random , 
centered,  or  in  the  corner.  Euclidean  distances  were  used.  Most 
experiments  were  run  with  the  center  at  the  geographic  center  of  the 
unit  square;  in  this  way,  larger  problems  could  be  examined. 

Several  series  of  problems  were  run  with  identical  values  of  n 
and  m  (and,  of  course,  different  randomly  generated  points)  to  see 
how  stable  the  running  time  is  from  one  problem  to  another.  The 
standard  deviation  was  found  to  be  close  to  the  mean  for  the  problems 
run.  This  essentially  says  that  we  should  not  pay  close  attention  to 
exact  run  times  or  the  exact  number  of  subproblems  examined. 

Series  of  problems  were  run  varying  n  and  m  and  using  both  the 
node  restricting  and  arc  restricting  procedures.  Both  procedures 
were  run  with  the  identical  problems  and,  furthermore,  the  same  set 
of  nodes  was  used  (with  new  nodes  added  as  the  problem  size  grew) 
for  all  problems  in  this  series.  This  results  of  this  experiment  are 
shown  in  Table  1.  As  can  be  seen,  the  running  times  for  both 
procedures  were  comparable  and  run  time  grows  exponentially  with 
problem  size.  (It  was  gratifying  to  find  that  the  optimal  solution 
values  found  by  both  procedures  always  matched!) 
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It  has  already  been  mentioned  that  these  procedures  yield  lower 
bounds  which  are  at  least  as  great  as  those  obtainable  using  uncon¬ 
strained  MST's.  This  was  verified  empirically  by  actually  generating 
lower  bounds  with  MST's  as  the  algorithm  proceeded.  The  program 
was  to  print  any  exceptions,  i.e.,  any  times  where  the  MST  gener¬ 
ated  a  higher  lower  bound;  none  occurred.  Figure  3  shows  some 
typical  lower  bound  values  obtained  using  partitioning  and  MST's.  As 
can  be  seen,  not  only  are  the  partitioning  lower  bounds  greater,  but 
they  grow  more  quickly  with  depth.  This  is  significant,  as  a  linear 
increase  in  lower  bound  value  will  reduce  the  run  time  exponentially. 

To  measure  the  impact  of  the  difference  in  lower  bounds  between 
the  MST  and  partitioning  methods,  several  problems  were  run  first 
with  the  partitioning  method  and  then  with  the  MST  method  of  lower 
bounding.  The  results  of  this  experiment  are  shown  in  Table  2.  As 
can  be  seen,  the  partitioning  algorithm  examines  a  much  smaller 
number  of  subproblems,  and  apparently,  its  effectiveness  increases  as 
the  problems  grow  larger.  Thus,  although  it  is  somewhat  more  diffi¬ 
cult  to  evaluate  the  lower  bound  using  the  partitioning  algorithm  than 
it  is  using  an  MST,  it  is  less  than  n  times  as  hard  to  do  so.  The 
reduction  in  the  number  of  subproblems  which  must  be  examined 
appears  to  be  sufficiently  great  to  warrant  the  use  of  the  partitioning 
technique.  Indeed,  as  the  problem  size  grows,  its  attractiveness 


seems  to  increase. 


-22- 


VI.  SUMMARY  AND  CONCLUSIONS 

The  purpose  of  this  study  was  to  develop  improved  exact  tech¬ 
niques  for  the  solution  of  the  CMST  problem  (a  model  of  the  multidrop 
line  problem)  so  that  known  heuristics  for  the  solution  to  that  prob¬ 
lem  could  be  examined  on  a  broader  class  of  problems  and  so  that  new 
heuristics  could  be  developed  on  the  basis  of  what  was  learned. 
Much  of  this  happened.  An  improved  exact  technique,  based  upon 
generating  lower  bounds  using  partitioning  instead  of  MST's,  was 
developed  and  computational  experiments  were  run  using  it.  The 
bounds  yielded  by  these  techniques  were  tighter  than  those  yields  by 
the  MST  based  techniques,  and  hence,  the  number  of  subproblems 
which  had  to  be  examined  in  order  to  obtain  a  solution  was  smaller. 
Indeed,  the  decrease  in  the  number  of  subproblems  examined  more 
than  compensated  for  the  increased  effort  required  for  the  examina¬ 
tion  of  each  subproblem.  Thus,  the  new  techniques  served  their 
purpose  in  that  they  permitted  the  examination  of  problems  not  care¬ 
fully  examined  before.  In  particular,  it  was  possible  to  examine 
problems  with  very  tight  constraints,  although  it  was  not  possible  to 
examine  problems  of  substantially  greater  size  than  had  been  pre¬ 
viously  examined. 

Even  with  the  improved  technique,  the  growth  of  run  time  with 
respect  to  problem  size  was  found  to  be  exponential,  albeit  of  a  lower 
order  than  previously  know  exact  techniques  and  of  a  much  lower 
order  than  the  solution  space.  Thus,  one  cannot  use  the  technique 
for  large  problems.  A  number  of  acceleration  techniques  were  devel¬ 
oped  and  incorporated  into  the  procedure. 
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Thus,  it  was  possible  to  examine  sufficiently  interesting  problems 
using  the  exact  technique  to  make  several  insights  into  the  problem. 
The  first  is  that  the  performance  of  the  known  heuristic  degrades  as 
the  constraint  tightness  increases  and  improves  as  the  problem  size 
increases.  An  imporved  heuristic  [33]  was  also  developed  on  the 


basis  of  this  study. 
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Table  1:  Number  of  Subproblems  Examined 


n  m  Node  Restricting  Arc  Restrictin 


8 

2 

16 

34 

8 

3 

13 

24 

10 

2 

120 

199 

10 

3 

57 

68 

10 

4 

67 

73 

12 

2 

298 

696 

12 

3 

375 

362 

12 

4 

171 

138 

14 

2 

766 

723 

14 

4 

526 

379 

16 

3 

not  run 

1085 

16 

4 

818 

737 

18 

3 

not  run 

6832 
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TABLE  2:  COMPARISON  OF  NUMBER  OF  SUBPROBLl'.MS  EXAMINED 
USING  MST  AND  PARTITIONING  AS  LOWER  BOUNDS 


n 

m 

NUMBER  OF 
SUBPROBLEMS 
(MST) 

NUMBER  or 
SUBPROBLEMS 
(PARTITIONING) 

8 

3 

87 

24 

12 

3 

2,767 

362 

20 

7 

4,205 

146 
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2200  - 
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FIGURE  3:  COMPARISON  OF  LOWER  BOUND  VALUES  USING  MST  AND 

INTERSECTION 


METVORX  DESIGM  'WITH  A.S  OBJECTIVE  FUNCTION  IMCLUCING  A  FIXED  tKARCE  (Revised) 


A.  K«t«heab«ua* 

Felytcchnle  lascitute  of  New  York 


I.  Intreducciott 

There  are  easy  accwork  deiiia  probiea*  ubieb 
would  be  eaaily  solvable  escape  for  cbe  presence 
of  a  fised  ebarte  in  cbe  cost  of  each  facillcy. 
The  preseoce  of  the  fised  cbarie  however,  which 
caeaoe  be  ifeered  uichouc  destroyiat  the  validity 
of  the  aodel,  eakes  these  probieae  euch  oore 
difficult  to  solve.  We  befio  by  preseotiog 
several  such  problesis  and  i  general  technique 
which  say  be  of  use  ia  obcaiaiog  soluttoos  to  all 
of  thea.  We  then  focus  on  ooe  such  problen  aod 
actually  apply  the  technique  to  its  seiutioa. 

II.  Probiea  Stoteaents 

The  aoec  basic  probiea,  and  the  ooe  which  we 
will  ultiaately  focus  on,  ia  that  of  locating 
earth  stations  ia  a  satellite  network.  We  are 
given  a  fised  set  of  N  terainal  locations  and  a 
set  of  .1  potential  earth  station  location  sites. 
Tetniaals  coaaunicate  with  one  another  via  the 
satellite  and  they  reach  the  satellite  via  direct 
terrestial  connection  to  an  earth  station  (sea 
Figure  1).  The  cost  of  ceoacctiag  teruiaal  t  to 
earth  scacioa  j  is  given  by  c, , .  The  cost  of  an 

earth  station  at  the  j  potential  sice  is  de~ 
noted  by  dj .  The  cost  of  a  satallite  link'  (er  of 

the  satellite  itself)  is  is  asnuned  to  'oe  dis¬ 
tance  isdependnnt  and  linearly  related  to  its 
capacity.  Thus,  tbs  cost  of  the  aataliita  links 
is  net  affected  by  the  selection  of  earth  sta¬ 
tions  and  era  tnerefors  'jnorac.  assuae 

intcially  taat  tnere  are  oo  capacity  constraints 
on  the  earth  stationa. 

The  probiea  daacribad  above  la  a  wall- 
known  uacapacitatod  facility  location  probiea 
(tirtP): 

N  n 

ninialse  Z  »  I  Z  C. ,  E. ,♦  Id.Y 
i«l  j«i  W  iJ  j  -  J 

*  This  work  was  lupportad  in  part  by  tha  US  .iray 
Coradeoa  under  Grant  CAAK-8C-gO-X-OS79  and  ay 
tha  National  Science  Foundacioa  under  Grant 
SMC- 7908 I 20 


Figure  1.  NETdORX  HOOEi 
fubjacc  to: 

Z  X.  »  I  far  all  i 

J 

X^j  <  Tj  for  all  i  and  j 

where  X^^  cerraapobda  to  the  fraction  at 

tomintl  i  asaigned  to  facility  j  acy  y;  la 
I  if  facility  J  is  locluoad  ia  tha  solution. 


Ihis  problea  bai  beea  nucb  studied  [l-10|.  The 
book  by  Hsadler  sad  .'lirchaadaai.  [5]  is  a  coapea- 
diuB  ot  related  probleas  and  solutioas.  The 
procedure  oi  Erlenkotter  [3|  has  beea  showa  to  be 
successful  ia  obtaiaiaf  optiaai  solutioas  to  tsaay 
significaat  uocapacitated  facility  locacioa 
probleas.  The  procedure  «e  will  describe  la  the 
followiag  sectioa  is  equivaleat  to  Irleakotter' s 
procedure  ia  the  case  of  the  uacapacitated  facil* 
ity  locatioa  problea  but  differs  froa  his  ia  that 
it  is  based  directly  ia  coobiaacorics  (rather 
then  ia  a  liaear  prograoaiog  dual)  aad  is  direc¬ 
tly  esteaeible  to  aaay  other  probleas  described 
below,  aaoag  thea  the  capacitated  facility  loca- 
tioo  problea. 

Ue  aote  that  the  UFLP  reduces  to  a  trivial 
problea  where  each  teiaiaal  is  cooaected  to  its 
oeatest  (least-cost  cooacctioa)  earth  statioo  if 
we  Ignore  the  fised  cost  of  the  earth  statioas. 
This  is  the  property  which  uaifies  all  the  prob- 
leas  we  will  exaaiac. 

There  are  easy  exteaaioaa  to  the  tlFIf  which 
are  also  part  of  this  class  of  probleas.  If  a 
capacity  coaatraint  ia  iaposed  oa  the  earth 
statioas,  the  problea  reduces  to  aa  aaaigaaeot 
problea  if  the  cost  of  the  earth  statioas  is 
ignored  and  we  assuae  either  that  the  capacity 
conatrainc  la  oa  the  ouaber  of  terainals  or  we 
allow  a  teraiaal  to  be  associated  with  acre  than 
one  earth  statioo  and  the  cost  of  coonection  is 
linear  with  traffic. 


lowing  traasforaalioa.  A  bogus  poteatial  earth 
station  locatioa,  e_^, ,  is  introduced  with  d  =0 

and  C.  ,oV.  for  iai,...N.  The  corresponding 

capacitated  or  uacapacitated  problea  is  then 
solved  to  find  the  optiaai  solutioa  including  all 
terainals.  The  bogus  earth  station  and  all 
terainals  associated  with  it  ace  then  reaoved 
froa  the  optiaai  solutioa  to  yield  an  optiaai 
solutioa  to  the  problea  with  a  choice  of  which 
terainals  to  Include  [&].  Thus,  we  will  liait 
coosiderstions  to  the  fised  teraiaal  probleas  in 
the  sequel. 

Another,  closely  related,  problea  is  that  of 
locating  cepeaters  in  a  radio  based  network. 
Here  we  assuae  we  are  given  a  sec  of  .V  teraiaal 
locations  and  H  potential  repeater  locations. 
Terainals  coanunicate  via  radio  links  to  the 
repeaters.  The  repeaters  coanunicate  directly 
with  one  another,  again  via  radio  links,  relaying 
aesseges  between  terainals.  A  terainal  can  or 
cannot  coanunicate  with  a  repeater  depending  on 
Che  distance  between  then  and  the  nature  of  the 
intervening  terrain.  Thus,  we  have  a  UFL?  with 
all  c^je{0,  a).  As  we  will  tee,  the  nature  of 

Che  cost  natrix  nakes  this  preblcn  different  fron 
the  previously  stated  UFLP't  in  several  inportant 
ways  which  will  alter  the  effectiveness  of  the 
proposed  solution  techniques. 

II.  Basic  Procedure 


Another  problea  is  that  of  designing  a 
network  as  above  where  not  all  teminals  need  be 
included.  In  this  version  of  the  problen,  there 
is  a  value,  V^,  associated  with  including  cemi- 

nal  i  in  the  network  and  we  seek  to  nasiniss  the 
profit,  P,  where  P  is  given  by 
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subject  to 


We  now  present  a  general  procedure  for  the 
solution  of  sll  of  the  above  probleas.  For  the 
sake  of  clarity,  we  describe  application  of  the 
procedure  to  the  solutioa  of  the  UFL?.  Siailar 
procedures  have  beea  proposed  by  Held  &  Karp  [b], 
aad  Caaeriai  [1]  for  the  solution  of  a  broad 
class  of  probleas.  Those  procedures  have  been 
shown  Co  be  effective  ia  soae  cases  aad  ineffec¬ 
tive  ia  others.  One  of  our  objectives  is  to  shed 
light  oa  inherent  differences  anong  probleas 
which  differeaciace  Chen  in  terns  of  the  effec¬ 
tiveness  of  these  procedures  and  others  like 
Chen. 


M 

XX..  e  V.  for  sll  1 
jal  ^ 

*ij  -  ’^i  - 


Let  •  penalty  added 

Co  the  cost,  C^j,  of  ccnaectiag  teraiaal  i  to 
earth  station  j.  If  P^j  >  0  for  all  i  and  j  and 


*lj’  “i*  ^j  i  aad  j 

A  capacitated  version  of  this  problen  exists 
by  adding  the  constraincs 

X  Z. .  t.  <  S,  for  all  j 
^  *  J 

where  C^  is  the  traffic  associated  with 
temiaal  1  and  Sj  ia  the  capacity  of  earth  ata- 
cion  j . 

This  problen  can  be  reduced  to  the  preceding 
one,  with  fixed  teminal  locations,  by  the  fol¬ 


I  ’  -J 


Chen  the  solution  to  Che  UFLP  using  in  place 
of  aad  i*  *  relaxation  of  the  original 
UFL?  for  any  satisfying  the  above  con¬ 
straints.  Here  the  facility  costs  are  discrib- 
uced  over  the  teminals.  The  solution  ta  the 
problen  with  d.vO  it  trivially  obtained  by  se¬ 
lecting  the  nininun  in  each  row.  We  note 

that  the  solution  value  thus  obtained  is  indeed  a 
lower  bound  to  a  feasible  solutioa  since 


% 


■9 


z 

UI. 
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where  I .  Is  the  set  of  ceraiaals  associated  with 

J 

earth  statioa  J  in  the  solutioa  to  the  relaxed 
probleiB. 


Ideally,  we  would  choose  the  P^j  to  aasiaiee 

the  value  of  the  relaxed  optioua  aad  thus  provide 
as  tight  a  bound  as  possible,  i.e.,  solve: 

Maxiaize  over  P^^. 

[2  0  I  -in  >P,j)l 


This  problea  has  been  successfully  approached 
using  subgradient  optiaization  techniques  (6,10]. 
For  the  aosMnt,  however,  we  defer  further  discue- 

sion  of  how  beat  to  set  the  P..  and  note  that 

tj 

relatively  siaplc  heuristics  often  suffice  in 
aany  cases  while  in  other  eases  even  the  optiaal 
P^j  (i.e.  those  aaxiaizing  the  lower  hound)  leave 

a  lower  bound  significantly  lower  than  the  opti* 
aal  solutioa  to  the  uarelaxed  problea. 


For  the  specific  ease  of  the  UFLP,  a  linear 
prograaaing  problea  can  be  solved  to  find  the 
optiaMl  P^j : 

Haxiaize  2  e  Z  V. 

i  * 


Cij  *  -  '^i  ^  J 

2  P. .  ■  d,  for  all  j 
i  J 


Pij  >  0 

This  is  the  basis  for  Erlenkotter' s  procedure 
but,  again,  he  points  out  that  often  a  siaple 
heuristic  suffices  to  set  the  P, . . 

ij 

The  solutioa  to  the  relaxed  problea  aad  the 
values  of  (slack)  variables 


S.  *  I  MAX  (0,  V.  -  C. .) 
J  ^  J  tj 


can  be  used  as  the  basis  for  a  branch  aad  bound 
procedure  to  find  the  optiaua  UTLP  solution. 
Specifically,  we  begin  by  setting  the  P^j  and 

solving  the  relaxed  problea.  We  then  obtain  a 
feasible  solutioa  to  the  original  problea  by 
selecting  the  earth  stations,  j,  for  which  S^vO. 

These  correspond  to  earth  stations  which  justify 
their  cost  by  virtue  of  the  savings  obtained  for 
temiaals  associated  with  thea.  To  see  this, 
note  that,  during  the  procedure,* we  try  to  aake 
(or  equivalently,  the  aiaiania  in  row  i)  as 

large  as  possible  in  order  to  sake  the  lower 


bound  as  large  (tight^  as  possible.  Thus, 
aust  equal  0  for  soae  j  with  C  <  V.;  or  else  we 

ij  -  J 

would  aake  larger. 

Let  J*s{j|S,sO}.  If,  for  all  i,  C.,  >  V. 
*  J  *  “  J 

for  all  jeJ  except  for  a  single  j  tl)e  bound  is 
tight.  If  for  two  or  wore  JCJ  ,  there  is 

a  gap  between  the  lower  bound  and  the  feasible 
solution.  Erlenkotter  suggests  a  procedure  for 
adjusting  the  to  soaetiacs  eliainate  such 

situaciona.  If  a  gap  reasins,  one  such  j  is 
selected  and  alternately  forced  in  and  out  of  the 
solution.  The  procedure  it  continued  with  the 
newly  foraed  tubprobleas  in  place  of  the  problea 
which  spawned  thea.  A  subproblea  is  said  to 
fathoa,  and  bence  it  eliainated  froa  further 
consideration,  when  its  lower  bound  equals  or- 
exceeds  the  currently  best  feasible  solution. 

A  siailar  procedure  can  be  defined  for  the 
capacitated  facility  location  problea.  Again  the 
fixed  coats ,  ,  are  replaced  by  penalties ,  P^j , 

added  to  the  C^j  satisfying  P^j  >  0  and 

Z^  P^^  -  ^j '  ^  assignaent  problea  it  then 

solved  to  obtain  a  lower  bound  on  the  original 
problea.  Heuristics  for  setting  the  P.  .  aad 
obtaining  a  feasible  solution  are  lets  oovious 
here  aad  very  little  work  has  previously  been 
done  on  these  probleas. 


III.  A  More  General  Procedure 

We  now  describe  the  procedure  in  a  aore 
general  context.  Define  a  aatroid,  M,  to  be  a 
2>cuple  (E,F)  where  E  is  a  finite  set  of  eleaeats 
(e^ I jsl, ' ' 'H)  aad  F  is  a  faaily  of  independent 

subsets  of  E.  Our  notion  of  independence  is  very 
general.  We  require  only  that  independent  sets, 
I,  satisfy: 

1.  If  I  t  F  and  I'  c  I  then  I'  e  F;  i.e.,  all 
subsets  of  an  independent  set  are  inde¬ 
pendent. 


Given  cuo  ladepeadenc  deds,  I  and  I  ,, 

? 

coocaininf  p  and  p  1  eleacnts  respec- 
civcly,  there  exists  soac  e  cl  ,,  e.Cl 

j  ?*1'  j  p 

such  that  I  (e  jcF.  That  is,  uc  can 
always  find  ele^nt  in  the  larger  inde- 
peodenc  sec  (not  already  in  the  smaller  onei 
which  can  be  added  to  the  smaller  indepen* 
deac  sec  without  destroying  independence. 


ever,  using  the  oatroid  properties  that  the 
algorithm  (']  solves  the  proolem. 

Greedy  Alsorithm 
e 

Step  0:  I  0 

i  0 

Rcindex  Che  e.  so  that  w.  >  w.  for 
j  <  k  J  J  -  * 


The  book  by  Welsh  [9]  is  an  excellent  treat¬ 
ment  of  aatcoid  theory  and  the  book  by  Lawler  [7] 
gives  applications  of  aatroid  theory  to  the 
solution  of  combinatorial  optimization  problems 
including  much  of  the  discussion  in  this  section. 
Caaerini  [l]  gives  much  of  the  rest. 


Step  1 :  i  i  ♦  1  ^ 

If  i  >  M,  stop;  l'  is  the  optimal 
set 

Step  2:  If  I*  {e.}  e  F  set  I*  *•  I*  {e^} 

Return  to  itep  1 


There  arc  many  types  of  matroids.  Three 
types  are  of  particular  interest  to  us: 

1.  Hatric  Matroids  -  Let  E  be  the  set  of  col¬ 
umns  of  a  matrix  and  F  the  family  of  sets  of 
coluams  which  arc  linearly  in  dependent. 

Z.  Graphic  Matroids  -  Let  C  be  the  set  of  edges 
of  an  undirected  graph  and  F  be  the  family 
of  edge  sets  that  contain  no  cycles  (i.c., 
members  of  F  are  ::reas  and  forests). 

3.  Partition  Matroids  -  Let  C  be  an  arbitrary 
set  and  i  »  1,*"K  be  a  partition  of  E; 

i.e.,  ^  ^  *i  Bj  m  d  for  i  P  j. 

Let  F  be  the  family  of  subsets  of  E  con¬ 
taining  no  more  than  A,  elcawnts  from  B^, 
i.e., 

I  e  F  iff  II  Bj^l  <  for  all  i 

It  is  fairly  easy  to  see  these  three  notions 
of  independence  satisfy  the  two  rc<iuired  proper¬ 
ties  and  thus  that  the  three  2-tuples  are  indeed 
matroids . 

Matroids  are  of  interest  in  the  context  of 
the  solution  of  combinatorial  optimization  prob¬ 
lems  because  of  the  existence  of  very  efficient 
procedures  for  the  solution  of  problems  which  can 
be  shown  to  be  equivalent  to  finding  the  "best"' 
independent  set  in  a  matroid. 

Specifically,  suppose  a  weight,  w^,  is 

associated  with  each  element  c.tE  for  soac 

J  ★ 

aatroid  (E,  F) .  We  may  wish  to  find  the  set  I  eF 
satisfying 

W(I*)  *  MAX  (W(I)) 

leF 


where  W(I)  a  £  w 
jel  J 

Without  the  structure  imposed  by  the  matroid 
properties,  the  solution  of  this  problem  could 
entail  examining  all  IcF,  potentially  on  the 
v 

order  of  2‘  subsets  if  lEI^N.  We  can  prove,  how- 


Thus,  we  need  only  sort  the  elements  of  E 
and  examine  each  element  once,  testing  for  inde¬ 
pendence  with  the  previously  chosen  eleswnts. 
Assuming  the  test  for  independence  is  not  too 
difficult  (and  it  is  quite  simple  in  the  three 
cases  above) ,  the  overall  procedure  is  of  the 
order  of  a  polynomial  in  N.  Thus,  large  problems 
can  be  solved. 

Furthermore,  even  if  w^  is  replaced  by  any 
monotone  (increasing)  f(w^)  the  greedy  algoritba 

will  still  find  an  optiaial  solution;  chat  is,  the 
optisMl  solution  found  using  w^  above  is  alto 

optimal  for  any  monotone  f(w^).  Thus,  f  need 
only  be  evaluated  for  the  e.  actually  chosen. 

Sometimes  we  wish  to  find  an  independent  set 
of  minimum  total  weight  subject  to  the  restric¬ 
tion  chat  it  also  be  of  maximum  cardinality.  An 
example  of  this  is  a  minimal  spanning  tree  (i.e. 
finding  a  tree  spanning  all  nodes  and  of  minimum 
total  length) .  The  greedy  algorithm  can  be  used 
to  solve  this  problem  by  setting  w^  o  A  - 

where  the  are  edges  of  the  given  graph,  Z^  is 
the  length  of  e^ ,  and  ^  ^  J ■ 

Unfortunately,  many  problems  cannot  be 
phrased  as  finding  maximum  weight  independent 
sets  in  a  isacroid  because  the  sets  satisfying  the 
constraints  do  not  coofocm  to  the  two  properties 
given  above.  One  vay,  however,  be  able  to  phrase 
the  problem  as  the  intersection  of  two  or  more 
matroids  as  follows. 

The  intersection  of  two  matroids,  M.jV(E,  F^) 
and  M2v(E,  F2),  defined  over  the  same  sec  of 

elements  but  with  two  different  notions  of  inde¬ 
pendence,  is  the  2-euple  (E,F)  where  IcF  iff 
(IcF,  and  leF^)-  Unfortunately,  the  intersection 
of  two  matroids  Is  not  a  matroid  and  thus  many 
properties  and  algorithms  do  not  carry  over. 
There  is,  however,  an  efficient  algorithm  Ic.f. 
kowler)  for  finding  the  maximum  weight  set  IcF 
for  an  intersection  of  two  matroids. 


Colt  Matrix 


The  intecoectioa  i  aatroids  cia  stailarly 
be  defined  is  a  Z-tuple  (E,F)  where  IcF  iff  IcF. 

J 

for  all  jal,2,'''K.  There  is  no  known  aiioritlia 
with  a  runnini  tiac  polynoaial  in  the  niiaber  of 
claacnts  in  E  to  find  the  aaxiaua  wei(ht  IcF  for 
i02.  Indeed,  the  problea  for  K  >  3  has  been 
shown  to  be  SIP'coapiete.  It  has  also  been  shown 
chat  Che  probleas  for  K>3  are  esscncially  cquiva* 
lent  to  probleat  for  .<>3,  i.e.,  it  is  only 
sli(hcly  acre  difficult  to  solve  probleas  for 
(cncral  K  than  it  is  to  solve  probleas  for  Ks>3. 

Ue  now  turn  to  the  detailed  solution  of  one 
of  the  specific  probleas  described  in  the  preced¬ 
ing  sections. 

IV.  The  Repeater  Location  Problea 

The  techniques  described  in  the  preceding 
sections  is  applied  to  the  solution  of  the  re¬ 
peater  location  problea  in  a  broadcast  rsdio 
network.  In  this  problea,  we  are  given  a  sat  of 
X  terainal  locations,  t^,  a  tec  of  H  potential 

radio  repeaters,  r ^ ,  and  a  coverage  aaCrix 
defined  by: 

1  if  t.  can  coMunicace  with  r, 

■*  0  otherwise. 

A  feasible  network  design  corresponds  to  a  selec¬ 
tion  of  repeaters  which  "cover''  all  the  teminals 
in  the  sense  that  each  ccminal  tan  coeounicate 
with  at  least  one  repeater.  (Ve  assusM  the  re¬ 
peaters  can  always  coasninicste  with  one  another.) 
An  optisul  solution  is  a  feasible  set  of  repeat¬ 
ers  of  aittisnia  cardinality.  In  this  siaplc  fora 
of  the  problea,  all  repeaters  are  assuaed  to  have 
identical  cost  (say,  1)  and  unliaiccd  capacity. 
Thus,  the  problea  is; 

H 

tUniaize  2*2  T, 
jal  J 

9 

s.t.  X  C. ,  T,  >  1  for  i  *  I,  •H 
j*l  ‘■J  J  ■ 

and  Tj  a  0  or  I  for  j  *  l,'"S 

The  variables  correspond  to  selecting  the 
repeater  in  potential  location  j  or  not 

selecting  it  (T^aO).  This  is,  of  course,  a  set 
covering  problea  (c.f.  Garfinkle  and  Xeahauser). 

We  could  also  considered  this  problea  as  an 
uacapacitated  facility  location  problea  (UFLF). 
In  tUs  latter  case,  the  coverage  aatrix  corre¬ 
sponds  to  the  aatrix  of  costs  of  connecting 
location  i  to  facility  j  in  the  UTLF  as  follows : 


Coverase  .latrix 


C.  .  =  l 

<3> 

c. . 

IJ 

ij 

c.  .*0 

<X> 

c. . 

i: 

One  '  could  thus  consider  solving  the  repeater 
location  (XL?)  problea  at  a  UFLP.  In  practice 
this  turns  out  to  be  a  bad  idea  because  of  the 
tynaetry  of  the  RIP.  In  particular,  all  repeat¬ 
ers  have  identical  costs  and  all  tcrainals  con¬ 
nect  with  identical  cost  to  repeaters  which  cover 
thea.  An  algoritha  for  the  general  UFL?  would 
have  to  wade  through  a  large  nuaber  of  alternate 
optiaa.  We  can  eliainate  a  large  part  of  this 
problea  and  in  fact  use  the  svaawery  to  advantage 
by  applying  the  following  doainance  rules  (c.f. 
(iarflnlde  and  Xeahauser)  to  eliainate  potential 
solutions : 

Ut  ■  {j  I  c.j  *  1} 

and  T^  *  {i  I  c^^  »  1) 

Thus,  is  the  set  of  repeaters  which  can  cover 

terainal  1  and  T.  is  the  set  of  terainals  covered 
by  repeater  j .  ^ 

Xule  1;  If  then  t^  doalnates  t^  and  t^ 

caa  be  eliainated  froa  the  problea. 

Rule  2:  If  T^  then  r^  ioainates  r.^  and  r^^ 

can  be  eliainated  free  tbe  problea. 

'  It  is  easy  to  see  that  it  suffices  to  con¬ 
sider  only  undoainated  terainals  and  repeaters  in 
seeking  an  optiaal  solution  to  the  RIP  since  any 
solution  containing  a  doainated  repeater  could 
have  included  an  undoainated  repeater  instead  and 
any  solution  which  covers  an  undoainated  terainal 
also  covers  all  terainals  doainated  by  it. 

To  get  the  full  benefit  of  doainance  aaong 
terainals  and  repeaters,  rules  1  and  2  should  be 
applied  repeatedly  until  no  further  reduction  in 
the  problea  can  be  effected,  because  it  nay  be 
that  r^  doainates  r^  after  soae  doainated  terai- 

nals  are  reaoved  froa  conaideration.  Further- 
aore,  in  the  course  of  a  branch  and  bound  pro¬ 
cedure,  where  repeaters  are  forced  into  and  out 
of  the  solution  to  define  disjoint  subprobleas, 
rules  I  and  2  nay  be  successfully  applied  to  find 
further  doainance  in  subprobleas. 

A  version  of  the  above  procedure  was  coded 
up  and  run  on  a  variety  of  probleas.  The  results 
of  these  runs  are  shown  in  Tables  I  &  II.  .is  can 
be  seen,  the  procedure  is  quite  effective  for 
aany  probleas  even  though  only  relatively  naive 
branching  and  bounding  heuristics  were  used.  In 
particular,  the  branching  rule  used  was  to  find 
the  repeater  which  covers  the  largest  nusber  of 
currently  uncovered  and  undoainated  terainals  and 
alternatively  forcing  it  in  and  out  of  the  solu¬ 
tion.  The  lower  bound,  was  generated  for 

each  subproblea  using  the^following  procedure. 


St«P  3: 


Z.,  =  aumber  of  repeaters  forced  lata 
tbe  soLutioa  ia  tbis  tubproblea 


„  ,  -  “  '•ij  -  ‘ 

‘j  -  if  Clj  *  ® 

For  each  repcacor,  r ^ ,  Let  Hj  a  tbo 

auaber  of  uncovered  and  undoainnced 
tnrainala  covered  by  r^.  Also,  let 

S^ai.  Sj  ia  Che  re«aiaint  weight  to  be 
aaaigned  in  coluau  J  of  the  cost  a«trln 

Find  Che  coluan  (repeater) ,  K,  with 
■arina  ^^7  tonnider  J  such 

Chet  >  0.  Let  b  ■  Sj,/irj, 

For  i  a  1,  M 
[For  j  a  I,  M 


TABL£  II  (Ranoon  Coverage) 


a  1  and  C, 


a.j-l 

aSj-b 


Step  A:  If  there  ia  any  J  with  Mj>0,  retun  Co 
Step  2.  Otherwiae  go  to  Step  S 

S££E-i*  ^  ^  *  I  •J® 

.Vote  that  Che  above  procedure  siaply  aaaigna 
b  to  all  eleaenta  in  a  tow  at  each  atep  and  thia 
ia  of  the  general  fora  deacribed  above. 

Rcaearch  is  currently  ia  progreaa  to  idea* 
tify  aere  effective  branching  and  bounding  rvlea. 
Vo  see  (aee  Table  0  that  for  probleaa  with 
geographic  coverage,  i.e.  whan  the  valuea  of 

are  baaed  on  the  diatance  between  t^  and  r^,  Che 

procedure  worka  very  well.  Thia  ia  becauae  the 
doainance  is  very  effective  in  these  cases.  For 
randoa  coverage,  however,  the  doainance  ia  leas 
effective.  The  overall  procedure  is  nonetheless 
sufficiently  effective  to  solve  probleaa  with  SO 
locations. 
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ABSTRACT 

An  algorithm  Is  described  for  designing  cir¬ 
cuit  switched  networks.  In  contrast  to  previous 
work,  the  algorithm  considers  both  voice  and  data 
traffic.  The  network  design  is  cost  efficient. 

It  provides  topological  design  as  well  as  routing 
doctrine.  Performance  analysis  for  both  voice  and 
data  traffic  Is  carried  out. 

INTRODUCTION 

Circuit  switching  Is  a  technology  which  hat 
long  been  employed  In  voice  consnunlcatlon  networks. 
In  such  networks,  prior  to  any  communication  tak¬ 
ing  place  between  two  users,  a  connecting  route  Is 
established  between  them.  A  sequence  of  channels 
or  trunks  Is  reserved  for  their  use.  The  route  is 
dedicated  to  their  consnunlcatlon  for  Its  duration. 
If  a  route  cannot  be  reserved,  due  to  traffic  con¬ 
gestion,  the  user  Initiating  the  request  for  con¬ 
nection  Is  "turned  away."  His  call  Is  "lost." 
Circuit  switched  networks  are  thus  characterised 
by  an  overhead  delay  In  the  establishment  of  the 
connecting  route.  However,  they  are  also  charac¬ 
terised  by  a  low  delay  during  the  actual  period  of 
communication.  This  Is  due  to  the  absence  of  any 
contention  after  a  route  has  been  set-up.  These 
are  conditions  well-suited  to  voice  coemiunlcatlona 
The  duration  of  the  typical  telephone  conversation 
Is  normally  much  greater  than  the  time  for  route 
set-up.  The  overhead  can  be  easily  tolerated. 

The  present  paper  provides  an  algorithm  for 
designing  circuit  switched  consnunlcatlon  networks. 
In  contrast  to  previous  work,  attention  Is  direc¬ 
ted  to  combined  voice  *  data  traffic,  rather  than 
to  voice  traffic  alone.  There  is  great  Interest 
In  designing  networks  for  such  Integrated  communi¬ 
cations.  Traffic  volijnes  are  much  higher  than  In 
separate  networks.  Consequently,  savings  can  be 
achieved  through  economy  of  scale. 

In  most  Integrated  communlcatlona  voice  traf¬ 
fic  (at  least  at  the  outset)  is  much  higher  than 
data  traffic.  Circuit  switching  provides  a  reas¬ 
onable  connection  technologv.  This  is  the  net¬ 
working  alternative,  which  from  the  delav  point  of 
view,  la  most  well-matched  to  the  dominant  traffic 
element.  If  data  traffic  grows  relative  to  voice 
traffic,  arguments  can  well  be  made  for  using  ot¬ 
her  methods  of  connection,  such  as  packet  switch¬ 
ing  or  hybrid  swltcblhq.  However,  in  a  combined 
environment  voice  traffic  will  always  be  signifi¬ 
cant.  Circuit  switched  networks,  with  their  mat¬ 


ure  switching  technology,  will  always  be  attract Ive. 

The  algorithm  presented  is  unified.  It  pro¬ 
vides  a  circuit  switched  network  design  for  a  pre¬ 
specified  end-to-end  performance  criterion  --  the 
probability  that  a  call  Is  tost.  The  design  spec¬ 
ifies  the  network  topology  --  the  connectivity  of 
network  nodes  and  the  dimension  of  links  In  terms 
of  number  of  trunks.  A  routing  doctrine  Is  speci¬ 
fied  with  both  primary  and  alternate  routes.  The 
average  cross-network  delay  performance  for  data 
traffic  Is  provided.  Network  cost  Is  given,  hav¬ 
ing  been  computed  from  both  link  and  node  require¬ 
ments.  It  should  be  emphasized  chat  only  the  back¬ 
bone  network  Is  considered.  Local  traffic  la  aa- 
siised  to  be  concentrated  at  the  backbone  nodes  us¬ 
ing  standard  teleprocessing  techniques  such  as  mul¬ 
tiplexing  and  concentration.  The  authors  have  drawn 
from  a  wealth  of  previous  work  In  the  area  of  cir¬ 
cuit  switching.  Specific  acknowledgement  Is  paid 
to  the  work  of  Katz  and  Knepley  and  tJj  .  How¬ 
ever,  It  Is  believed  that  this  Is  the  first  algor¬ 
ithm  which  considers  both  voice  and  data  traffic 
carried  on  the  same  circuit  switched  network.  Fur¬ 
thermore,  the  voice  and  data  traffic  are  not  multi¬ 
plexed  on  the  same  channels,  (In  a  TASI  type  arran- 
pwnt),  rather  they  contend  for  available  link  cap¬ 
acity. 

DRSCRIFTION  OF  THE  ALGORITHM 
High  Level  Description 

Before  proceeding  with  a  detailed  description 
of  the  design  algorithm  It  wnild  be  worthwhile  to 
take  a  look  at  a  high  level  block  diagram.  Illus¬ 
trated  In  Figure  1.  The  major  functional  sc^nents 
of  the  design  procedure  are  shown  here.  The  PRE¬ 
LIMINARY  SEQIENT  provides  the  necessary  design  In¬ 
puts,  c.g.  node  locations,  traffic,  desired  perfor¬ 
mance,  etc.  It  also  '^nassages"  the  Inputs,  putting 
them  In  the  form  necessary  for  later  algorithmic 
procedures.  Ilie  TOPOLOGICAL  DESIGN  SEGMENT  carries 
out  the  optimization  of  the  network  design.  It  pro¬ 
vides  a  cost  effective  connectivity  of  the  nodes 
for  the  desired  traffic.  It  also  dimensions  the 
network  links  to  meet  the  desired  performance  goal. 
The  ROUTING  SEOIENT  gives  the  doctrine  to  be  used 
on  this  topological  design.  It  defines  the  routes 
through  the  network  used  tc  connect  desired  end- 
user  nodes.  The  block  labelled  NETWORK  SPECIFICA¬ 
TIONS  sunnarlzes  the  network  design  up  to  the  point 
In  the  algorithm;  that  Is,  the  topology,  trunk  di¬ 
mensions  and  routing  doctrine.  The  VOICE  PERFOR¬ 
MANCE  ANALYSIS  SEGMENT  analyzes  the  performance  of 
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the  network  stored  in  the  riETUORK  SPECIFICATION 
with  respect  t^  voice  traffic.  I.  estimates  the 
averai;e  probability  of  a  lost  ca^l  for  the  given 
input  traffic.  This  is  the  probabilitv  that  a 
voice  coninunicator  desiring  connection  to  a  remote 
node  is  turned  away.  Nothing  is  done  if  the  aver¬ 
age  probabilitv  of  a  lost  call  is  within  a  given 
tolerance  of  the  goal  in  the  PRELIMINARY  SEOfErCT. 
Otherwise,  instructed  adjustments  are  fed  back  to 
the  NETWORK  SPECIFICATION.  The  topological  design 
Is  not  changed.  However,  the  link  dimensions  in 
terms  of  trunks  are  changed  to  bring  the  measured 
average  probability  of  a  lost  call  close  to  the 
desired  goal.  The  DATA  PERFORMANCE  ANALYSIS  SEG¬ 
MENT  measures  the  average  cross  network  delay  for 
data  traffic  carried  in  the  resulting  network  sto¬ 
red  in  the  NETWORK  SPECIFICATION  block.  The  de¬ 
sired  goal  for  delay  does  not  impact  network  de¬ 
sign.  This  block  is  merely  a  post-processor.  The 
COSTING  SEOiENT  block  computes  Che  transmission 
and  switch  cost  for  the  resulting  network.  Finall)( 
the  OUTPUTS  block  summarizes  the  circuit  switched 
network  design. 


High  Level  Block  Diagram  of  Algorithm 


Detailed  Description 

A  detailed  description  of  each  porC<.on  of  the 
algorithm  structured  In  Fig.  1  will  now  be  given. 


Preliminary  Segnent  Consists  of  three  separ¬ 
ate  blocks  shown  in  Fig.  2.  The  INPUT  BLOCK  stor¬ 
es  the  constraints  which  the  network  la  designed 
to  satisfy.  Here  are  the  backbone  nodes  and  their 
locations.  These  are  labelled  1 ,2 , . . .1, . . . J NH. 
The  voice  traffic  is  stored  as  a  matrix  and 

the  data  traffic  as  a  matrix  Voice  traffic 

units  are  Erlangs  and  data  traffic  units  are  bits 
per  second.  The  1-J  entries  of  each  of  these  ma¬ 
trices  constitute  the  traffic  originating  at  node 
"1"  which  Is  destined  for  node  "J".  Voice  traffic 
Is  taken  to  be  syninetrlc.  A  party  Initiating  a 
voice  call  must  automatically  provide  an  ans«rer- 
path.  An  Erlang  of  voice  traffic  originating  at 
node  "1"  with  destination  "j"  automatically  imp¬ 
lies  an  equal  flow  from  "j"  to  "i".  The  design 
algorithm  assumes  a  particular  "architecture"  for 
handling  data  traffic.  Parameters  associated  with 
this  architecture  are  provided  in  the  INPUT  BLOCK. 
Specifically,  two  different  types  of  data  messages 
are  allowed  to  be  handled  by  the  desired  circuit 
switched  network,  a  bulk  file  transfer  (type  A)  and 
a  short  interactive  message  (Type  B).  A  fixed 
fraction  of  the  data  traffic  pattern,  RHOA  is  assu¬ 
med  to  correspond  to  Type  A  traffic  and  a  fixed 
fraction,  RHOB  is  assvsnvd  to  correspond  to  Tvpe  B 
traffic.  Type  A  file  transfer  messages  are  taken 
to  consist  of  lA  bits  and  are  handled  in  the  fol¬ 


lowing  wav.  Upon  the  start  oi  a  Tvpe  A  message 
generation,  a  connecting  route  to  the  destination 
is  obtained  by  dlai-up.  It  turned  awav  redialing 
is  carried  out  everv  ten  si conds  until  i  connection 
is  obtained.  The  Tvpe  A  message  is  bufferred  and 
fed  into  the  connecting  route  at  a  rate  of  R  bits 
per  second.  The  route  ts  ht-ld  until  the  end  of  a 
single  message  transfer.  The  connection  is  then 
broken.  lA  and  R  are  listed  in  the  INPUT  BLOCK. 
Typical  values  are  10^  and  9600.  Type  B  interac¬ 
tive  traffic  is  assumed  t.i  have  the  following  cha¬ 
racteristics.  An  interactive  message  is  LB  bits 
long.  A  stream  of  interactive  messages  generated 
at  one  node  with  another  as  the  destination  have 
time  separations  of  T  seconds.  T  is  the  "think" 
time  which  depends  upon  the  speed  and  attention  of 
the  terminal  user  generating  the  messages.  Upon 
the  start  of  a  Type  B  message  generating  session 
(the  generation  of  the  first  message)  a  route  is 
dialed  up,  as  for  a  Type  A  message.  However,  aft® 
the  first  message  is  sent,  the  route  is  not  closed 
down.  Rather,  it  is  taken  to  be  held  open  until 
K-l  additional  Type  B  messages  are  sent  over  it. 
Messages  are  assumed  bufferred  and  fed  to  the  con¬ 
necting  route  at  a  rate  of  R  bits  per  second.  The 
parameters  LB,  T  and  K  are  provided  in  the  INPUT 
BLOCK.  Typical  values  are  LB*1000  -•  a  standard 
teletype  line,  T"10  sec.  and  K"200  or  1.  K“200 

corresponds  to  classical  circuit  switching  of  an 
Interactive  data  stream.  Here,  after  dial-up  a 
terminal  user  holds  a  connecting  route  to  a  compu¬ 
ter  for  an  entire  computing  session.  K"l  corres¬ 
ponds  to  "fast  circuit  switching."  Here,  dial-up 
and  routing  are  carried  out  for  each  interactive 
message. 
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Fig.  2  Details  of  Preliminary  Segment 

The  PERFORMANCE  GOAL  BLOCK  stores  the  end-to- 
end  performance  desired  for  the  circuit  switched 
network  being  designed.  This  is  the  average  jnd- 
to-end  loss  probability,  Pj^  ,  averaged  over  ail 
node  pairs  and  weighted  bv  traffic.  It  also  inc¬ 
ludes  the  average  cross  network  delav  desired  for 
Type  A  and  Type  B  messages  --  agair  averaged  over 
all  node  pairs  but  weighted  onlv  by  data  traffic. 

Before  the  actual  network  design  proiedure  c.sn 
begin  some  processing  of  the  parameters  stored  in 
the  INPUT  BLOCK  must  be  carried  out.  This  is  done 
in  the  CON’/ERSION  BLOCK.  Spec  i  flea  1  Iv,  the  inter- 
node  distances  are  computed  using  the  node  loca¬ 
tions.  The  inter-node  distances  are  stored  ip  the 
matrix  [^L^  j}  .  The  i-J  entry  of  this  matrix  is  s 

measure  of  the  cost  of  s  link  directly  connecting 
node  "i"  to  node  "J".  The  network  design  procedne 
operates  with  a  combined  traffic  matrix  of  voice 
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and  data  traffic.  The  combined  entrv  has  units  of 
Erlanii!'..  Hence,  the  data  traffic  li.is  t.'  he  convtT^ 
ted  to  equivalent  Erlang  values  and  addi- 1  t.'  the 
voice  traffic.  This  is  carried  out  in  this  block, 
by  ctnputing  the  average  nunber  of  data  calls  per 
second  and  the  call  holding  tine  for  the  .issined 
data  traffic  characteristics  and  handling  strate*^ 
The  following  formula  Is  used  to  compute  EO.^,  the 
Erlang  value  of  the  data  traffic  originating  at 
node  "I"  with  node  "j"  as  destination. 

ED^j  -  j(RHOA)(^)\+((RHOB)(D.  j)[i  ♦•(I-OT'JL 
^  R  ^  C  R  K  LB  y 

with  this,  the  Total  Traffic  Matrix 
J_EDj^^  Is  computed. 

Topological  Design  Segment  is  the  heart  of 
the  network  design  algorithm.  Optimization  of  the 
network  is  carried  out  In  this  segment.  Details 
of  the  Topological  Design  Segment  are  illustrated 
In  Fig.  3.  Certain  parameters  must  be  defined  be¬ 
fore  the  optimization  procedure  of  this  segment 
can  conmence.  This  definition  takes  place  In  the 
INITIALIZATION  BLOCK.  Specifically,  the  value  of 
a  parameter  "E"  la  set  equal  to  I  P,  .  This  cor- 

/Tsr 

responds  to  the  link  blocking  probability  "guess¬ 
timate"  required  to  achieve  the  desired  end-to-end 
average  loss  probability.  was  given  in  Che 
PKELMNASY  SEGMENT.  This  "formula"  for  E  is  de¬ 
rived  from  a  "union"  .bound  assuming  that  on  the 
average  there  are  'Tnn  links  In  each  end-to-end 
route.  A  parameter  "TEST"  is  set  initially  equal 
to  00  •  This  paramtcr  Is  used  In  the  test  for  con¬ 
vergence  of  the  optimization  procedure.  Finally, 
a  matrix  defined.  This  Is  a  NN  x  NN  ma¬ 

trix.  It  Is'^referred  to  as  the  "weight  matrix." 
Initially,  It  Is  defined  as  having  all  entries 
equal  to  1. 


Fig.  3  Details  of  Topological  Design  Segment 

The  TOPOLOGICAL  DESIGN  SE01ENT  operates  by 
connecting  nodes  with  "short"  high  usage  routes. 
With  any  "rational"  cost  measure  this  should  be  ec¬ 
onomic  network.  With  parameters  defined  in  Che 
INITIALIZATION  BLOCK,  the  CONNECTIVITY  BLOCK  is 
entered.  Here,  nudes  are  connected  bv  routes  us¬ 
ing  the  following  procedure.  The  CONNECTIVITY 
BLOCK  computes  a  matrix  Cl^ngthjj'3  .  The  1-J  ccni- 
ponent  of  this  matrix  Is  given  by  the  formula 
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Length .  ”  W  L .  , 

i]  IJ  ij 

Lengthy  j  i.i  a  weigbled  distance  botuceo 
and  node  "J".  With  the  matrix  Ij.engtb.  ^ 

the  CONNECTIVITY  BLOCK  considers  each  Succissive 
node  pair!  -7,  1^7.  It  ii.ses  a  "shortest  o.itb" 
algorithm  to  connect  the  nodes  in  each  pair.  Eacl 
ordered  node  pair  "X  ,7  is  connected  bv  a  route  ot 
directed  links  froml-to-J.  However,  lengths  in 
the  shortest  path  algorithm  arc  taken  relative  t 
the  weighted  distances  given  by  matrix  ^Length,  j]. 

In  the  first  iteration  the  network  put  nut  by  the 
CONNECTIVITY  BLOCK  will  be  fully  cunnected. 

With  topological  connectivity  defined  the 
TRUNK  ENGINEERING  BLOCK  is  entered.  Each  link  of 
the  connected  network  Is  considered.  All  of  the 
end-to-end  shortest  oath  routes,  determined  in  the 
CONNECTIVITY  BLOCK,  utilizing  a  given  link  are  er- 
umerated.  The  end-to-end  traffic  requirement  tor 
each  route  Is  obtained  frtxn  the  matrix  Tt  ”1.  For 

tj 

a  given  link  between  nodes  "l"  and  "J",  the  sum 
end-to-end  traffic.  A.,,  of  all  "weighCed"^orcest 
path  routes  traversing'^this  link  is  obtained.  If 
circuit  switched  "calls"  are  routed  on  these  patb^, 
Aji  upperbounds  the  traffic  that  will  be  offered  to 
directed  link  (l,J).  The  TRUNK  ENGINEERING  BLOCK 
solves  the  equation  E  •  E(Sj,,  A.,)  for 
quantizes  It.  E(  ,  )  is  the^blocrl 
Ity  given  by  the  Erlang  B  formula.  S. .  provides 
dimension  of  directed  link  (1,J)  in  trunks. 

The  COST  METRIC  BLOCK  computes  a  measure  of 
the  cost  of  the  connected  and  trunk  dimensioned 
network  which  has  been  produced  by  the  t--  '  previous 
blocks.  COST  METRIC  •  Th'  s*  is 

over  all  ordered  node  pairs  1,J  which  si  e  adjacent 
In  the  network.  The  value  of  COST  METRIC  is  com¬ 
pared  to  the  value  of  the  parameter  TEST.  If  it  is 
greater  than  or  equal  to  TEST,  the  TOPOLOGICAL  DE¬ 
SIGN  SEGMENT  is  exited.  If  COST  METRIC  is  less 
than  TEST  the  optimization  procedure  continues. 
Since  TEST  -00  I  nltially,  the  procedure  continues 
for  at  least  two  iterations.  When  it  continues. 
TEST  Is  set  -  COST  METRIC  and  a  new  weight  matrix 
la  computed  from  the  following  formula, 

W.  .  -  ■&  E  (S.A)  E  (S.A)^ 


1  and 
pfobabil • 


ij 


A  /  I-  "b  S 

The  denominator  In  this  formula  is  the  rate  of 
change  of  E(S,A)  over  the  Interval  fs,  S+l],  1. 
E(S+1,  A)  -  E(S,A).  This  can  be  reduced  to 


I 


W^,  -fE(S-l,A)  -  EtS.A)  kC  . _ 

^  d  E(S,A)  -  E(S+l,A)  3  UfA.'S)  E(S-1,A»1 
The  above  two  formulas  are  evaluated  at  S-S 


1 


A-Ajj.  The  TOPOLOGICAL  DESIGN  SEiIMF.NT  then 

the  next 


1  i 


goes 

iteration.  The  CONNECTIVITY  BLOCK  now 
computes  the  matrix  [Length^j^  using  the  new  woiijr 
matrix.  This  iterative  procedure  continues  until  a 
COST  METRIC  TEST  is  obtained.  Ey  studying  the 
formula  for  Wj  .  one  may  note  that  low  values  arc 
obtained  when  dealing  with  a  link  which  in  resnoose 
to  an  incremental  Increase  in  offered  flow  can  ach¬ 
ieve  the  deslrgd  blocking  probabilitv,  F. ,  wirb  j - 
few  additional  trunks  as  pi'ssibl.’.  The  TOPOt.OG TCAI. 
DESIGN  SEGMENT  achieves  a  connectivity  oi  the  rulv 
by  employing  short,  highly  utilized  directed  links 
This  provides  a  cost -e f f Ic lent  circuit  switched 
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network.  Note  that  the  u»e  of  the  weight  iratrtx 
and  the  specific  formula  for  the  weight  are  simi¬ 
lar  to  the  I  onvex  branch  e  I  Iminat  loi'  methi'd  used 
to  design  packet  switched  networks. [ij 

Routing  Segment  In  circuit  switched  networks 
when  a  ciximun  ic  a  t  or  Jt  ”i"  wi.slus  to  "talk"  to  .>ne 
at  node  "j",  a  connection  is  first  attempted  oy 
reserving  trunks  on  a  path  termed  the  "primary 
route."  If  due  to  blocking  the  connection  cannot 
be  completed  an  attempt  is  made  at  reserving 
Crunks  along  an  alternate  route.  The  ROUTING  SEG¬ 
MENT  of  the  algorithm,  shown  in  Fig.  /* ,  provides 
both  primary  and  alternate  routing  doctrines  for 
the  network  derived  in  the  TOPOLOGICAL  DESIGN  SEG¬ 
MENT.  The  primary  route  from  node  "I"  to  node  "J" 
Is  taken  as  the  shortest  directed  psth  between 
these  two  nodes.  "Shortne;s"  here  is  taken  to 
mean  length  relative  to  [,Longth|  lO,  as  computed  in 
the  last  iteration  of  the  TOPOLOGICAL  DESIGN  SEG¬ 
MENT.  The  alternate  routing  doctrine  defined  Is 
a  form  of  "progressive  routing."  A  detailed  des¬ 
cription  (xf  It  Is  given  In  Ref.  1.43.  Basically, 
at  each  of  the  Interior  nodes  of  the  primary  route 
connecting  "1"  to  "j",  there  la  a  table  which 
lists  routes  to  node  "j".  These  routes  diverge 
from  the  primary  route  at  the  Interior  node.  The 
routea  In  each  table  are  ordered  In  a  hierarchical 
manner.  In  attempting  to  reserve  the  primary 
route,  If  blocking  Is  encountered  at  interior  node 
"k",  the  routing  table  residing  at  this  node  is 
consulted.  An  attempt  at  completing  the  route  is 
then  mede  elong  the  first  entry  In  the  table.  If 
there  Is  blocking  along  this  route  the  second  is 
tried,  etc.  If  there  Is  blocking  along  all  of 
these  routes  the  "caller"  Is  turned  away  from  the 
network.  The  primary  route  and  possible  alternate 
routes  In  one  situation  are  tlluatrated  In  Fig.  5. 
In  the  version  of  the  algorithm  which  haa  been  im- 
pleaiented,  diverging  alternate  routes  are  taken  to 
be  the  second  shorteat  route,  the  third  shortest 
route,  etc.  Shortness  is  taken  according  to  the 
last  values  of  [Length^j*^. 


SOUTISS  SCCfCNT 
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FIG,  4  DETAILS  OF  ROOTING  SEGMENT 


Network  Sped  f  lest  ton  In  this  module  of  the 
algor ItlRii  the  network  design  Is  scored;  chat  Is, 
the  topology,  link  trunk  dimensions  and  routing 
doctrine  are  sunmerlzed. 


Voice  Performance  Analysts  Segment  and  1ns- 
ructed  Adjustments  The  goal  of  the  algorithm  is 


Fig.  5  The  Primary  Route  From  Node  ”i"  to 
Node  "J"  with  Diverging  Alternate 
Routes 


to  design  a  cost-efficient  network  satisfying  a 
constraint  on  P  .  The  TOPOLOGICAL  DESIGN'  SEGMENT 
attempts  to  meef  this  constraint  bv  sizing  the 
trunk  dimensions  of  each  link  to  correspond  to  the 
trunk  engineering  by  utilizing  the  Erlang  B  formu¬ 
la.  However,  the  data  traffic  and  alternate  rout¬ 
ed  traffic  nay  not  have  the  Poisson  arrival  statis¬ 
tics  assigned  by  this  formula.  As  a  result,  the 
network  initially  stored  in  the  NETWORK  SPFCIFtCA- 
TION  BLOCK  may  not  meet  the  desired  P^.  The  'ifOICE 
PERFORMANCE  ANALYSIS  SEGMENT  and  INSTRUCTED  ADJUST¬ 
MENT  BLOCK  correct  this  deficiency.  The  VOICE  PER¬ 
FORMANCE  ANALYSTS  SEGMENT  analyzes  the  network 
stored  in  the  NETWORK  SPECIFICATION  BLOCK  and  ob¬ 
tains  an  estimate,  called  Pj^,  of  P^.  Details  of 
the  analyjjis  procedure  are  given  in  Ref.  C4l-  The 
estimate  P^^  Is  compared  to  the  desired  value  of  P|  . 
If  it  is  within  Zi  ,  (typically  «10"^),  the  net¬ 
work  stored  In  the  NETWORK  SPECIFICATION  BLOCK  is 
Accepted  as  the  design  and  the  algorithm  proceeds 
to  the  DATA  PERFORMANCE  ANALYSIS  SEGMENT.  If  P^^ 

Is  not  within  A  the  algorithm  proceeds  to  the  IN¬ 
STRUCTED  ADJUSTMENTS  BLOCK,  which  computes  a  new 
value  of  link  blocking  probability  form  the  previ¬ 
ous  one  for  trunk  engineering.  The  new  value, 
E(ncv),  Is  obtained  from  the  following  formula. 
E(new}  -  E  i-fl/JUS')  (Pj,  -  '?l)  •  The  INSTRUCTED  AD¬ 
JUSTMENTS  BLOCK  then  takes  the  network  topology 
stored  In  the  NETWORK  SPECIFICATION  BLOCK.  It  car¬ 
ries  out  the  trunk  engineering  of  each  link  tc  sat¬ 
isfy  the  blocking  probability,  E(newi.  A  new  net¬ 
work  la  then  produced;  the  routing  doctrines  stay 
the  same.  This  new  network  replaces  the  previous 
one  stored  In  the  NETWORK  SPECIFICATION  BLOCK.  Its 
loss  probability  Is  analyzed  In  the  VOICE  PERFOR¬ 
MANCE  ANALYSIS  SE01ENT.  The  procedure  continues  as 


described  above  until  the  trunk  engineering  produ 
CCS  a  within  A  of  the  desired  Pj  . 


Data  Performance  Analysts  Scs 


This  block 


computes  D^^  and  Dg,  the  average  cross-network  de¬ 
lay  for  Type  A  and  Type  B  data  messages,  respect¬ 
ively.  The  average  is  taken,  among  other  things, 
with  respect  to  all  node  pair  data  traffic  require¬ 
ments.  These  quantities  are  computed  by  obtaining 
Djj  for  each  traffic  type.  This  Is  the  average 
croas-netwerk  delay  for  a  data  message  (either 
Type  A  or  B)  uliich  has  its  source  at  node  "1"  ard 
destination  at  node  "J".  Is  computed  by  the 

algorithm  using,  ®lj  ”  ^L  ^L 
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the  average  tine  taken  In  waiting  to  redial  until 
a  route  la  finally  granted.  It  has  been  assianed 
that  If  a  data  communicator  Is  turned  away  he  re¬ 
dials  after  10  seconds.  Hence,  C^*10 
Cp  Is  Che  signal  propagation  time,  which  Is  deter¬ 
mined  by  the  actual  length  of  the  l-to-J  primary 
route.  Most  traffic  will  go  along  this  route. 

Is  the  transmission  time  for  a  message,  given  by 
Che  formula  C^mnessage  length/R.  Message  length 
is  or  L.  as  specified  In  the  PRELIMINARV  SEG¬ 
MENT.  Is  Che  average  time  taken  to  set-up  the 
route  connecting  node  "1"  to  "j"  or  to  turn  the 
communicator  away  because  of  Inability  to  reserve 
a  route.  Let  this  actual  route  set-up  time  be  de¬ 
signated  as  Tf.  For  the  Type  A  file  transfer  mes¬ 
sage,  or  for  a  Type  B  Interactive  traffic  with 
classical  circuit  switching,  the  entire  set-up  de¬ 
lay  Is  borne  by  the  first  message.  The  remaining 
K'l  messages  have  zero  set-up  time.  In  this  case 
the  average  •  T^/K.  The  value  of  T^for  C^)  de¬ 

pends  upon  the  type  of  signalling  employed  by  the 
circuit  switched  network  In  order  Co  set-up  a 
route  —  Che  methods  used  to  transmit  routing  In¬ 
formation  through  Che  network  In  reserving  crunks. 
The  design  algorithm  aasianes  that  Coninon  Channel 
Interswltch  Signalling  Is  employed.  Separate  di¬ 
gital  links  are  assumed  to  connect  Che  nodes  of 
Che  network  and  are  reserved  strictly  for  signal¬ 
ling  information.  Routes  are  reserved  by  sending 
small  addressing  packets  on  these  signalling  Uefes. 
Transmission  occurs  at  the  race  of  R,  bits  per  se¬ 
conds.  Typically,  Rj  ■  9600.  A  circuit  switch 
model  Is  Incorporated  into  this  block  of  Che  algo- 
rltlmi  In  order  to  sccount  for  delays  In  Che  switch 
These  delays  correspond  to  queuing  delays  In  hav¬ 
ing  Che  switch  deal  with  the  routing  requests  and 
finding  a  path  through  Che  switch. 

Costlna  Seament  In  this  block  the  transmis¬ 
sion  cost  and  Che  switch  cose  of  the  circuit  swit¬ 
ched  network  design  are  computed .  Transmission 
cost  of  the  network  Is  taken  to  be  the  sum  of  the 
seperace  transmission  costs  of  each  of  the  links 
In  the  network.  Switch  cost  Is  taken  to  be  the 
SUB  of  fhe  costs  of  the  Individual  switches.  Be¬ 
cause  of  Che  modularity  of  this  block  It  can  be 
progranned  to  model  any  tariff  and  circuit  switch 
cost.  For  example.  In  our  experiments  ve  have  us¬ 
ed  the  DOS  line  rates. 

Outputs  In  this  block  the  final  circuit 
switch  network  design  with  performance  and  cost  Is 
scored. 


a  DEC  20/50  with  the  TOPS  20  operating  syarem. 
The  memory  required  was  close  to  b5,Ono  ( J6  bit) 
words . 


Fig.  6a  Pl  •  .002,  Cost  •  $2,476,025  per  month 


Fig.  6c  Pj^  •  .1,  Cost  ■  $1,B32,319  per  month 
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EXAMPLE  NETWORK  DESIGNS 

The  topologies  of  typical  network  designs  ob¬ 
tained  with  the  algorithm  arc  shown  In  Figs.  6a, 
b,  &  c.  These  are  eight  node  networks  spanning 
the  continental  United  States.  They  were  designed 
for  total  voice  traffic  •  5400  Erlangs,  total  data 
traffic  -  361  kbps,  RHOA  •  RHOB  •  0.5,  K  •  200 
(classical  circuit  switching),  LB  •  1000,  T  •  10. 
^e  designs  were  carried  out  for  different  values 
of  Pj^.  Costs  arc  noted.  These  designs  were  ob¬ 
tained  by  Implementing  the  algorithm  In  Fortran 
The  running  time  waa  approximately  five  seconds  on 
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ABSTRACT  The  problem  of  determining  the  optimal  number  of  WATS  lines 
given  an  hourly  profile  of  outbound  calls  from  a  given  location  is  considered  It  is 
shown  that  the  relationship  between  cost  and  earned  load  under  the  new  WATS 
tariff  with  a  very  small  fixed  charge  for  lines  is  relatively  smooth  and  unimodal  It  is 
thus  possible  to  obtain  globally  optimal  solutions  to  most  problems  by  using  a  very 
efficient  local  optimization  procedure  which  only  considers  addition  and  deletion  of 
single  lines  and  the  exchange  of  one  line  for  another  Computational  experience 
with  this  procedure  and  a  comparison  with  exhaustive  search  arc  presented. 


1.  Introduction 

Wide  Area  Telephone  Service  (WATS)  is  a 
service  offered  by  telephone  companies  to 
provide  volume  discounts  to  larger  users.  Un¬ 
til  recently,  WATS  offerings  included  both  full 
and  measured  service  With  full  service 
WATS,  the  user  was  permitted  essentially  un¬ 
limited  calling  for  a  fixed  monthly  fee.  Meas¬ 
ured  service  provided  for  a  lower  fixed  fee.  10 
hours  of  free  service  and  an  hourly  fee  for 
usage  in  excess  of  10  hours  per  month.  The 
new  WATS  tariff  eliminates  full  period  WATS 
service  and  offers  a  somewhat  modified  form 
of  measured  service 

There  are  actually  many  WATS  offerings 
for  both  inward  and  outward  callers  to  and 
from  a  given  location  on  either  an  intrastate  or 
interstate  basis  We  will  focus  here  on  the  in¬ 
terstate  out-WATS  service  offered  by  AT&T 

■  This  work  was  suppoilod  in  pan  by  a  gram  from  Con- 
Tel  Information  Syslyms 


Long  Lines  within  the  continental  U.S.  In  this 
case  the  continental  U.S  is  divided  up  into  5 
bands.  Band  1  consists  of  area  codes  close  to 
the  caller  but  outside  his  state  Band  2  con¬ 
sists  of  these  area  codes  plus  others  some¬ 
what  farther  away.  This  continues  until  Band 
6.  which  includes  all  area  codes  in  the  U.S  . 
including  Alaska,  Hawaii,  Puerto  Rico  and 
the  Virgin  Islands.  In  this  paper,  we  will  deal 
primarily  with  the  5  WATS  bands  which  cover 
the  continental  U.S  outside  the  caller's  state. 
Thus,  each  WATS  band  includes  all  the  lower 
numbered  bands  and  represents  a  roughly  fixed 
percentage  of  the  telephones  in  the  U  S  re¬ 
gardless  of  where  the  caller  is  The  specific 
area  codes  involved  are  a  function  of  which 
state  the  caller  is  in.  The  service  applies  only 
to  interstate  calls.  Most  states,  if  not  all.  have 
intrastate  WATS  service  but  may  have  dif¬ 
ferent  characteristics  and  are  not  the  subject 
of  this  paper.  The  service  applies  only  to  out 
ward  calls  from  a  single  location  The  in- 
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terestcd  reader  is  referred  to  ll]  for  all  the 
details  of  this  tariff  Its  saliant  numerical  pa¬ 
rameters  are  described  in  the  following  sec¬ 
tions 

In  order  to  use  WATS  service,  the  user  or¬ 
ders  one  or  more  lines,  which  we  refer  to  as  a 
group,  in  one  or  more  of  the  6  bands.  A  line 
in  a  given  band  may  be  used  to  place  an  inter¬ 
state  call  to  any  area  code  in  that  band  Note 
that  if  the  user  has  both  Band  1  and  Band  2 
groups,  for  example,  a  call  to  an  area  code  in 
Band  1  may  use  a  line  in  either  group.  In  addi¬ 
tion  to  ordering  lines,  users  must  decide  on 
how  to  route  calls.  That  is,  they  must  decide 
whether  to  allow  an  outgoing  call  to  use  a 
higher  numbered  band  if  no  lines  in  its  band 
are  available,  or  overflow  the  call  to  DDD 
(Direct  Distance  Dialing;  i.e.,  the  call  is  placed 
as  an  ordinary  long  distance  call),  or  block  the 
call  entirely. 

2,  The  New  WATS  Tariff 

The  new  tariff,  as  it  is  used  in  this  paper,  is 
described  in  detail  on  the  8th  revised  page 
14.1  of  AT&T  Tariff  259  (1).  The  significant 
change  from  the  old  tariff  is  that  full  period 
lines  are  eliminated  and  only  measured  lines 
remain  Specifically,  the  cost  of  a  WATS  line 
carrying  hours  of  traffic  in  the  daytime,  H, 
hours  of  traffic  in  the  evening,  and  H„  hours 
of  traffic  at  night  costs: 

C  -  27.50  +  Crf(Hrf)  +  C.(H,)  +  CJH„) 

where 

is  the  cost  for  hours  of  daytime 
tiaffic,  computed  from  a  piecewise  linear  func¬ 
tion.  i.e.  so  much  per  hour  for  the  first  15 
hours,  so  much  per  hour  for  the  next  25 
hours,  etc.  For  example,  for  a  Band  5  line  in 
New  York  City, 

Cd  -  19.91  X  (min(Hd,  15)1 

+  17.71  X  (min  -  15,25)1* 
+  15.53  X  Imin(Hd  -  40.40))* 
13.13  X  |Hd  -  801* 


C,  IS  compulL'd  in  a  similar  manner  C„  is 
simply  b  92  x  H,  The  constants  used  to 
compute  Cp  are  exactly  b.S%  of  those  to  com 
pute  Cd  This,  presumably,  reflects  the  fact 
that  DDD  costs  in  the  evening  are  65%  of  the 
day  rates.  The  night  time  hourly  cost  is.  ap 
proximately  40%  of  an  average  day  rate, 
again  paralleling  DDD  costs  Table  1  gives  the 
WATS  and  DDD  costs  (estimates)  for  New 
York  City  These  are  used  in  the  computa¬ 
tional  experiments  and  examples  presented  in 
the  rest  of  this  paper. 


Table  1A.  WATS  Line  Daytime  Cost  (Hourly) 


Band 

1st  IS 
Hours 

Next  15 
Hours 

Next  40 
Hours 

All  Other 
Hours 

1 

15.13 

13.46 

11.79 

9.98 

2 

17.32 

15.42 

13.52 

11.43 

3 

18.14 

16.15 

14.15 

11.97 

4 

18.89 

16.81 

14.74 

12.47 

5 

19.91 

17.71 

15.53 

13.13 

Table  IB.  WATS  Line  Evening  Cost  (Houriy) 


Band 

1st  IS 
Hours 

Next  IS 
Hours 

Next  40 
Hours 

All  Other 
Hours 

1 

9.83 

8,75 

7.66 

6.49 

2 

11.26 

10.02 

8.79 

7.43 

3 

11,79 

10,50 

9.20 

7.78 

4 

12,29 

10.93 

9.58 

8.11 

5 

12.94 

11.51 

10,09 

8.53 

Table  1C.  WATS  Line  Nighttime  Cost  (Hourly) 


Band 

All  Hours 

1 

5.26 

2 

6.02 

3 

6.31 

4 

6.56 

5 

6.92 

Table  10.  Daytime  Average  Cost 

From  New  Yoik  City 

Band 

ODD  Cost  (Hourly) 

1 

19.96 

2 

22.86 

3 

2394 

4 

24  94 

5 

26  26 

where  (X|  *  is  X  if  X  a  0  and  0  otherwise 
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Thus,  wc  sec  that  WATS  and  DDD  rusu 
vary  in  similar  ways.  It  will  be  useful  for  us  to 
consider  a  model  of  how  cost  varies  with  band 
as  it  will  allow  us  to  make  a  quantitative  trade¬ 
off  between  traffic  in  different  bands  and  in 
particular  to  approach  the  problem  of  decid¬ 
ing  on  the  size  of  each  WATS  group  Table  2 
gives  the  relative  costs  in  each  of  the  bands  as 
seen  from  New  York  City.  These  factors  ap¬ 
ply  to  all  hourly  costs;  e  g.,  DDD  costs,  the 
first  15  hours  of  daytime  WATS  traffic,  etc 
The  factors  arc  exact  for  WATS  costs  and  rea¬ 
sonable  approximations  for  DDD  costs.  These 
factors  are  different  for  different  states,  but 
they  exist  for  all  states  and  are  easily  deter¬ 
mined  from  the  tariff. 

Indeed,  the  entire  WATS  tariff  may  be 
modelled  as  a  bulk  discount  on  DDD.  Given  a 
vector,  Dt,,  the  average  per  hour  DDD  cost  in 
Band  b,  the  constants  used  to  compute 
C«,  and  C„  can  be  thought  of  as  percentages 
of  Dfc.  For  example,  if  the  average  hourly  day¬ 
time  DDD  cost  for  Band  5  calls  from  NY  City 
were  $26.26  during  the  day,  we  could  write 
Cj  as 

Cj  -  26.26  X  {.758  x  min  (H^.  15) 

+  .674  X  (min(Hd  -  15,  25)1* 
+  .591  X  [min(Hd  -  40.  40) r 
+  .5  X  (Hd  -  80)}. 

That  is,  the  WATS  tariff  offers  a  discount  of 
between  25%  and  50%  relative  to  DDD.  The 
discount  is.  of  course,  relative  to  the  26.26 
figure.  If  this  figure  were  lower,  the  discount 
is,  relatively,  smaller.  Based  upon  an  assumed 
4  minute  holding  time,  this  rate  appears  to  be 
a  reetsonable  estimate.  Shorter  holding  times 
would  give,  comparatively,  higher  discounts. 


Table  2.  Relative  Costs 


Bartd 

Factor 

1 

1.000 

2 

1.145 

3 

1.199 

4 

1.249 

5 

1.316 

The  significant  thing  is  that  the  same  model 
with  the  same  discounts  applies  to  all  bands, 
all  states,  and  all  times  of  day. 

The  usage  of  a  WATS  line  in  a  band  is  de¬ 
fined  to  be  the  average  usage  over  the  group 
Thus,  if  there  are  M  lines  in  a  Band  b  WATS 
group  and  together  they  carry  H  hours  of  day¬ 
time  traffic  during  a  month,  then  =  H/M. 
A  larger  group  will  block  less  traffic  and, 
hence  carry  more  traffic.  Thus,  if  we  add  a 
line  to  the  group  both  H  and  M  will  increase. 
Since  the  efficiency  of  lines  within  a  group 
decreases  as  the  number  of  lines  increases, 
however.  H  will  increase  more  slowly  than  M 
and  will  decrease.  The  cost/hour  for  traffic 
carried  by  the  group  will  therefore  increase  as 
the  size  of  the  group  increases.  It  will  not. 
however,  exceed  the  DDD  cost  until  the 
$27.50  fixed  charge  becomes  a  significant 
portion  of  the  total  cost  of  the  line. 

The  choice  of  how  to  optimally  route  calls  is 
itself  an  interesting  problem  involving  trade¬ 
offs  between  cost  and  quality  of  service  In 
practice,  a  user's  routing  choices  are  limited 
by  the  capabilities  of  his  local  equipment 
(PABX)  and  may  include  overflow,  queueing 
for  lines  in  some  bands,  blocking,  and  even 
time-of-day  sensitive  routing.  This  is,  however, 
beyond  the  scope  of  this  paper.  The  simple 
assumption  is  made  that  a  call  is  offered  to  the 
lowest  numbered  band  capable  of  handling  it 
and  if  no  lines  are  available  in  this  band  then 
the  call  overflows  to  each  higher  numbered 
band  and  finally  to  DDD,  if  necessary. 

Thus,  the  focus  is  on  the  problem  of  deter¬ 
mining  the  optimal  number  of  i  les  in  each 
WATS  band  group.  The  larger  the  number  of 
lines  in  a  group,  the  more  tratiic  il’e  group 
carries  and  hence,  the  less  traffic  overflows  *o 
higher  numbered  groups  and  to  DDD.  The 
base  cost  of  a  line  is  very  small,  only  $27  50 
per  month.  At  first  glance  one  might  expect 
this  would  lead  to  solutions  with  many  lines 
However,  as  we  will  see  in  the  following  sec¬ 
tion,  the  tariff  provides  that  the  cost  per  hour 
decreases  as  the  number  of  hours  of  usage  in¬ 
creases  and  that  the  number  of  hours  of  usage 
is  averaged  over  the  lines  in  a  band  Thus,  add  - 
ing  a  line  to  a  group  in  a  band  will  decrease 
the  number  of  hours  of  usage  per  line  and 
hence,  the  average  cost  per  hour  of  carried 
calls  increases  This  limits  tfic  number  of  lines 
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desired  in  a  band  and  makes  the  problem  in¬ 
teresting 

3.  Optimization  Procedure 

The  optimization  procedure  is  imple¬ 
mented  in  FORTRAN  and  runs  on  a  DEC 
20-60.  The  software  package  includes  an  in¬ 
teractive  front-end  which  permits  the  user  to 
edit  a  WATS  configuration,  traffic,  tariff,  and 
parameter  values.  Finally.  It  includes  a  full 
spectrum  of  design  and  analysis  modules 
which  allow  him  to  design  configurations  us¬ 
ing  a  variety  of  options  for  queueing,  blocking 
and  overflowing  traffic.  For  the  sake  of  clarity 
of  presentation,  we  will  concentrate  on  the 
overflow  version  of  the  problem  in  the  re¬ 
mainder  of  this  paper.  Thus,  we  will  assume 
that  traffic  offered  to  a  group  overflows  to 
each  of  the  higher  numbered  groups  and 
eventually  to  ODD,  if  necessary. 

The  procedure  takes  as  input  a  description 
of  the  tariff  as  given  above  and  a  description 
of  the  traffic  to  be  carried  by  the  network.  This 
traffic  description  may  take  one  of  several 
forms  depending  on  what  information  is  avail¬ 
able.  Ideally,  the  user  will  process  billing  infor¬ 
mation  supplied  by  the  local  telephone  com¬ 
pany  and  get  at  least  one  month's  data  giving 
the  actual  number  of  minutes,  calls  and  the 
actual  ODD  (Direct  Distance  Dial)  cost  over 
the  given  time  period.  Alternatively  estimat¬ 
ing  procedures  may  be  used  if  this  informa¬ 
tion  is  not  available.  In  either  case,  the  user 
develops  a  traffic  matrix  which  gives  the  num¬ 
ber  of  erlangs  of  traffic’  to  each  band.  DDD 
cost  per  minute  in  each  band,  which  is  also  an 
input  to  the  program,  may  also  be  developed 
as  an  average  derived  from  actual  call  records 
or  alternatively  may  be  derived  from  an  aver¬ 
age  figure  for  the  band.  Usually,  this  will  not 
be  a  critical  issue  as  DDD  costs  do  not  differ 
widely  from  one  part  of  a  band  to  another 

If.  however,  a  large  amount  of  traffic  is 
known  to  go  to  a  location  near  the  edge  of  the 
band,  then  the  DDD  cost  of  this  traffic  will  not 
be  typical  lor  the  band  and  should  be  com¬ 
puted  using  the  actual  cost  of  calling  that  loca¬ 
tion  Also,  if  the  length  of  all  calls,  or  calls  to 


■  An  vrlang  is  a  miMSiiie  o(  liatfu'  inlenMty  One  erlang 
IS  eiiiiivaleni  i<*  nil  niinuu-s  nf  call  holding  lime  per  hour 
(i  e  sutlicienl  iratiii  'o  iipy  a  single  lin«'  tull  hrne) 


certain  b..nds  is  known  to  lie  unusually  short, 
then  the  one-minute  minimum  charge  which 
is  applicable  for  most  long-distance  calls  will 
significantly  increase  the  DDD  cost.  In  this 
case,  one  should  obtain  the  actual  number  of 
calls  so  that  a  more  accurate  estimate  of  the 
true  DDD  cost  can  be  computed. 

The  major  outputs  from  the  procedure  are 
the  number  of  lines  in  each  group  and  the 
total  WATS  and  DDD  cost.  Auxiliary  reports 
are  also  produced.  Including  the  cost  per 
band,  the  traffic  offered  to  and  carried  by 
each  band,  details  on  an  hour  by  hour  basis 
and  summaries  of  activity  at  major  locations 
(when  input  supporting  this  is  available) .  This 
is  useful  as  part  of  more  global  optimization 
procedures  which  determine  the  placement  of 
tie  lines,  FX  (foreign  exchange)  lines,  and 
tandem  switches. 

Figure  1  is  a  flowchart  of  the  global  optimi¬ 
zation  procedure.  It  begins  by  initializing  the 
configuration  to  an  all  band  5  solution,  i.e., 
no  lines  in  bands  1  through  4  and  a  sufficient 
number  of  lines  in  band  5  to  satisfy  a  given 
grade  of  service  during  the  peak  hour. '  As 
will  be  seen,  the  procedure  is  not  sensitive  to 
the  starting  configuration  used  In  particular, 
we  have  not  observed  any  significant  varia¬ 
tion  in  the  running  time  of  the  procedure  or 
the  quality  of  the  solution  it  produces  when 
the  starting  configuration  is  changed. 

The  procedure  is  constrained  to  produce 
solutions  with  no  fewer  than  LO,  lines  and  no 
more  than  HI,  lines  in  band  i.  The  values  of 
LO,  and  HI,,  which  are  inputs  to  the  pro¬ 
cedure  can  be  set  in  several  ways  First,  if  one 
wishes  simply  to  find  an  optimal  solution.  LO, 
can  be  set  to  0  and  HI,  can  be  set  to  a  large 
number  (e.g..  a  number  of  lines  sufficient  to 
carry  99%  of  the  load  offered  to  band  i  and  all 
lower  bands).  If  the  expansion  of  an  already 
existing  network  or  a  local  modification  of  a 
larger  network  (containing  private  tandem 
switches.  FX  lines,  etc  )  is  being  considered, 
LO,  and  HI,  may  be  set  to  values  very  close  to 
current  values  Finally,  by  setting  LO,  =  HI, 
“  U”,  a  specific  configuration  containing  I/, 
lines  per  band  can  be  evaluated. 


*  Grade  ftl  vrvict  is  didinod  as  the  fraction  of  calls 
lnt»t  r.ir'K'dI  In-  the 
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Fig.  1.  OPTSUB. 


Tlie  current  solution  being  examined  is 
maintained  In  the  array  VAL.  Thus.  VAL, 
contains  the  numbei  of  lines  in  band  i  in  the 
current  solution.  Local  perturbations  to  the 
current  solution  are  explored  by  Incrementing 
and  decrementing  individual  VAL,. 

The  procedure  EVAL  takes  VAL.  the  traf¬ 
fic  requirements,  and  the  tariff  description  as 
input.  It  then  calculates  the  total  amount  of 
traffic  carried  by  each  WATS  band  (and  by 
DDD)  and  costs  out  the  configuration,  in  the 
case  considered  here,  this  is  done  by  using 
the  Erlang-B  formula  and  allowing  traffic  to 
overflow  freely  from  each  band  through  the 
higher  numbered  bands  and  eventually  to 
ODD,  if  necessary.  In  other  versions  of  the 
procedure,  overflow  and  blocking  are  com¬ 
puted  on  the  basis  of  Rapp's  approximation  to 
Wilkinson's  Equivalent  Random  Method 
[4,5]  and  Barrer's  equations  [6]  for  estimating 
overflow  in  networks  with  queueing.  EVAL 
also  compares  the  total  cost  of  the  current 
solution  with  MINSUM,  the  lowest  cost  for 
any  configuration  thus  far  evaluated.  If  the 
cunent  cost  is  less  than  MINSUM,  it  replaces 
MINSUM  and  the  cunent  configuration  is 
retained. 

The  routines  ADD  and  ROLL  explore  local 
modifications  to  the  cunent  configuration  and 
comprise  the  heart  of  the  optimization  proce¬ 
dure.  ADO  considers  the  addition  of  one  or 
more  lines  to  band  BAND.  It  proceeds  by  add¬ 
ing  single  lines  to  the  group  in  band  BAND 
until  it  finds  the  cost  of  the  configuration  no 
longer  decreases. 

ROLL,  whose  flowchart  is  shown  in  Figure 
2,  proceeds  along  similar  lines  attempting  to 
exchange  lines  between  two  bands,  thus  effec¬ 
tively  rolling  traffic  from  one  band  to  another. 
ROLL  begins  by  removing  single  lines  from 
the  group  in  band  FROM  until  the  cost  in¬ 
creases.  It  then  attempts  to  reduce  the  cost  by 
adding  a  line  to  the  group  in  band  TO.  Thus. 
ROLL  considers  both  the  possibility  of  remov¬ 
ing  one  or  more  lines  from  band  FROM  and 
exchanging  one  or  more  lines  in  band  FROM 
for  the  same  number  of  lines  in  band  TO. 
Finally.  ROLL  considers  the  possibility  of  ex¬ 
changing  one  line  in  band  FROM  for  two  lines 
in  band  TO. 

It  should  be  noted  that  the  (unction  of  .^L)I) 
is  actually  crxitained  in  ROLL  ADD  is  used, 
however,  because  It  is  roughly  three  limes  as 


fast  as  ROLL.  ADD  and  ROLL  are  called  re¬ 
peatedly  to  improve  the  current  solution  via 
local  modifications  until  no  further  progress  is 
made.  The  best  value  prior  to  the  current  cy¬ 
cle  of  local  modification  Is  recorded  In  OLD.  If 
after  attempting  all  local  modifications,  MIN¬ 
SUM  is  still  unchanged,  the  procedure  termi¬ 
nates.  A  similar  technique  is  used  to  terminate 
procedure  ROLL. 

An  earlier  version  of  this  algorithm  only 
considered  ROLLing  between  adjacent  bands. 
The  procedure  was  not  as  much  faster  as  one 
might  have  expected  (typically,  the  difference 
was  less  than  a  factor  of  two)  and  the  results 
obtained  were  sometimes  slightly  worse. 

Note  in  Figure  1  that  band  5  is  treated 
somewhat  differently  from  the  other  bands. 
Variable  BAND  only  takes  values  between  1 
and  4  and  ADD  and  ROLL  are  called  accord¬ 
ingly.  For  each  value  of  BAND,  the  proper 
size  of  the  band  5  group  is  first  determined 
before  other  local  modifications  are  attempted. 
This  is  because  the  tradeoffs  between  other 
bands  are  generally  less  dramatic  than  trade¬ 
offs  Involving  band  5.  With  other  bands  the 
WATS  costs,  which  are  generally  similar,  are 
involved.  When  band  5  Is  involved  part  of  the 
tradeoff  is  with  DDD,  which  is  somewhat 
more  significant. 

In  both  the  flowcharts  in  Figures  1  and  2,  it 
can  be  seen  that  the  algorithm  proceeds  In  an 
orderly  fashion  to  consider  all  reasonable 
local  exchanges  (i.e.,  the  addition  of  a  line  in 
a  band,  the  deletion  of  a  line  from  a  band, 
and  the  exchange  of  a  line  in  one  band  for  a 
line  in  another) .  The  variable  HVE  in  Figure  1 
is  simply  the  constant  five.  The  variable  BAND 
takes  the  values  one  through  four  and  is  the 
current  band  being  considered  in  a  local  ex¬ 
change.  LOBAND  takes  values  between  one 
and  BAND-1  and  is  the  other  band  involved 
in  the  current  local  exchange.  At  each  stage, 
EVAL  is  called  to  evaluate  the  current  config¬ 
uration  and  update  MINSUM  if  this  configura¬ 
tion  is  the  best  yet  encountered. 

The  significant  feature  about  this  approach 
is  that  only  local  changes,  i.e.,  the  adding  or 
deleting  of  single  lines,  are  considered  and 
movement  in  a  given  direction  is  halted  when 
no  iiiilhcr  progress  is  encountered.  This 
greatly  improves  its  running  time  as  com¬ 
pared  with  a  procedure  considering  a  com¬ 
plete  spectrum  of  alternatives,  which  would 
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involve  considering  a  nuinlier  of  alternatives 
equal  to  the  product  over  i  of  (HI,  -  LO,). 

The  running  time  of  this  procedure  is  essen¬ 
tially  linear  in  the  number  of  perturbations  (at¬ 
tempts  to  move  a  line  from  one  band  to 
another)  examined.  The  time  to  evaluate  a 
perturbation  is  essentially  linear  in  the  number 
of  lines  in  the  network.  The  number  of  pertur¬ 
bations  varies  from  one  network  to  another, 
but  appears  roughly  linear  in  the  number  of 
lines.  The  procedure  ran  for  less  than  one 
minute  for  a  network  with  a  total  of  75  lines  in 
it. 

Theoretically,  in  order  to  guarantee  that 
such  a  procedure  will  converge  to  a  global  op¬ 
timum,  the  function  we  seek  to  minimize 
should  be  unimodal,  i.e.,  it  should  possess  a 
unique  minimum  which  is  reached  by  descend¬ 
ing  along  any  sequence  of  local  exchanges.  In 
an  absolute  sense,  the  function  we  seek  to 
minimize  is  not  unimodal.  However,  we  have 
found  it  to  be  nearly  so  in  practice. 

An  argument  can  be  made  for  the  validity 
of  the  procedure  based  upon  the  average  cost 
per  hour  in  each  group.  The  total  offered  bad 
is  fixed.  In  ail  cases— queueing,  blocking,  and 
overflow— the  total  carried  bad  is  also  essen- 
tiaily  fixed.  Thus,  to  minimize  total  cost,  one 
seeks  to  minimize  the  cost  per  hour  of  carried 
traffic.  The  procedure  described  above  can  be 
thought  of  as  one  which  constantly  seeks  to 
move  traffic  from  a  group  with  a  high  cost  per 
hour  to  one  with  a  bw  cost  per  hour  and 
stops  when  all  alternatives  have  roughly  the 
same  incremental  cost  per  hour. 

Such  a  procedure  would  work  perfectly, 
yielding  an  optimal  solution  every  time,  if  as 
traffic  was  moved  from  one  group  to  another, 
the  cost  per  hour  changed  in  a  consistent 
direction.  Unfortunately,  in  this  case,  it  need 
not.  As  traffic  is  moved  to  a  group,  the  group 
operates  more  efficiently  and  the  cost  per 
hour  for  traffic  carried  by  the  group  goes 
down.  This  is  good  as  it  reinforces  oui  dcci 
Sion  to  move  traffic  into  the  group  Utdortu 
nately.  as  the  traffic  offered  to  a  group  in 
creases,  so  does  the  amount  that  overflows 
from  it.  If  the  place  it  oveiflows  to  has  a  higher 
cost  per  hour,  then  it  makes  the  decision  to 
offer  traffic  less  desirable.  Furthermore,  ttie 
group  from  uhii. ii  traff'C  is  removed  will  be 
come  less  cflicienl  inil'ally  but  if  one  then  dv 
cieascs  the  y.ie  of  l!ie  group,  it  will  bi/come 


more  efficient  Thus,  there  arc  many  factors 
working  against  one  another  and  the  net 
result  is  not.  in  general,  clear.  All  this  is  com¬ 
pounded  by  the  fact  that  we  are  working  with 
different  amounts  of  traffic  In  different  hours 
and  so  the  effect  of  a  change  is  further  confused. 

If  we  examined  the  situation  more  closely, 
however,  we  will  see  that  there  is  a  cause  for 
optimism.  The  actual  procedure  does  not  di¬ 
rectly  offer  more  or  haS  traffic  to  the  group;  it 
either  increases  or  decreases  the  size  of  the 
group.  It  does  not  calculate  costs  per  hour:  it 
calculates  the  actual  cost  of  the  network. 
Also,  it  makes  the  smallest  possible  changes, 
working  with  increments  and  decrements  of  a 
single  line.  Thus,  it  makes  decisions  based 
upon  the  total  net  effect  of  a  change.  If  adding 
a  line  to  a  group  decreases  the  total  cost,  it 
will  keep  adding  lines  to  the  group  until  the 
net  effect  reverses.  The  changes  are  made 
gradually  so  that  the  net  effect  can  be  watched 
as  closely  as  possible.  The  entire  process  can 
be  viewed  as  one  of  equalizing  potentials  in 
an  electriai  network  or  as  rolling  water  among 
partitions  within  a  container.  The  problem  is 
that  whenever  you  roll  some  water,  some¬ 
body  comes  along  and  alters  the  height  of  the 
partitions. 

If  the  costs  per  hour  in  two  adjacent  groups 
are  nearly  equal,  there  is  little  to  be  gained 
from  roiling  traffic  between  them.  Once  we 
get  to  this  point,  we  can  only  make  small  mis¬ 
takes.  It  is  relatively  easy  to  get  to  this  point 
with  the  WATS  groups  since  their  costs  per 
hour  were  close  to  begin  with.  The  situation 
with  respect  to  ODD  is  different.  The  whole 
point  of  using  WATS  is  that  it  is  significantly 
cheaper  than  DDD  The  procedure  takes  this 
into  account  by  only  rolling  traffic  between 
groups  which  do  not  overflow  directly  to  DDD 
and  reoptimizing  the  last  group,  which  does, 
directly.  Thus,  for  example,  if  we  arc  consid¬ 
ering  the  effect  of  removing  a  line  from  the 
band  2  group,  we  will  also  reoptimize  the 
band  5  group  in  the  case  where  band  2  traffic 
overflows  to  all  higher  numbered  groups.  Re¬ 
call  that  this  "oplimizatior"  lenerally  amounts 
to  little  more  than  checking  if  it  pays  to  add  a 
line  to  the  band  5  group. 

There  is  only  rvic  remaining  cause  for  con- 
(  ern  Suppose  we  had  a  problem  with  identi¬ 
cal  traihf  in  earh  band  and  we  began  with  a 
v*Iutiun  with  equ,il  sveci  groups  in  each  band 
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It  might  turn  out  that  it  doesn't  pay  to  increase 
or  decrease  the  size  of  any  group  by  a  single 
line,  but  that  a  total  consolidation  into  a  prop¬ 
erly  sized  band  5  group  is  the  optimal  solu¬ 
tion  This  is  the  general  problem  of  an  econ¬ 
omy  of  scale  existing  but  not  manifesting  itself 
until  a  critical  size  is  reached  This  has  not 
turned  out  to  be  the  case  in  any  of  the  specific 
problems  examined  so  far.  but  it  is  something 
to  watch  out  for.  One  protection  against  this  is 
to  start  the  optimization  off  first  with  a  balanced 
solution  and  then  with  an  all  band  5  solution. 
This  is  what  is  done. 

4.  Experimental  Results 

In  order  to  better  understand  the  optimiza¬ 
tion  procedure  and  the  nature  of  the  solutions 
to  the  WATS  optimization  problem  under  the 
new  tariff,  a  set  of  experiments  was  run.  The 
purpose  of  these  experiments  was  to  test  the 
behavior  of  the  cost  function  directly.  Thus, 
instead  of  simply  running  the  optimization 
procedure,  we  selectively  enumerated  solu¬ 
tions  in  order  to  more  thoroughly  examine 
the  solution  space.  Subsequently,  we  ran  the 
optimization  procedure  and  verified  that  it  did 
indeed  find  the  same  solutions. 

Table  3  gives  the  salient  input  data  common 
to  all  experiments.  The  time-of-day  profile  is 
assumed  to  be  identical  for  all  bands  (not  a 
realistic  assumption,  but  simple  to  input)  and 
is  given  in  Table  3A.  Only  daytime  traffic  is 
considered,  again  for  simplicity.  Table  IB 
gives  the  cost  for  WATS  lines;  it  is  taken  from 
the  tariff  for  New  York  City.  The  ODD  costs, 
also  estimates  for  New  York  City,  are  given  in 


Table  3.  Time-of-Oay  Profile 


Hour 

Fraction 

8 

.02 

9 

.08 

10 

.12 

11 

.10 

12 

.14 

1 

.18 

2 

.16 

3 

.11 

4 

.06 

5 

.03 

Table  1C  Table  ID  gives  the  relative  costs  by 
band.  One  can  observe  that  these  (actors 
relate  all  WATS  costs  and  DDD  costs.  All  ex 
periments  were  run  with  22  engineering  days ' 
month. 

The  first  experiment  had  1  Erlang/day  of 
traffic  in  each  band.  Table  4  summarizes 
some  of  the  configurations  explored  The 
configurations  in  Table  4  are  the  number  of 
lines  in  each  band.  Commas  between  the 
numbers  are  omitted  when  they  arc  single- 
digit  numbers.  First  note  that  tne  pure  band  5 
solutions  are  unimodal;  i.e.,  they  decrease  in 
cost  initially  and  then  steadily  increase  in  cost 
as  the  number  of  lines  increases  This  is  not 
particularly  surprising.  It  is.  however,  signifi¬ 
cant  as  an  incremental  procedure  like  the  one 
described  here  would  not  work  if  the  cost 
were  not  unimodal. 

Starting  from  the  best  pure  band  5  solution  — 
00002  (Configuration  3),  the  best  solution 
(<'9)  is  reachable  through  local  transformations 
via  the  configurations  3,  6.  and  9.  Suppose, 
however,  that  after  Configuration  6,  Configu- 


Table  4. 1  Erlang/Band 


Number 

Contig. 

Cost 

Bands 

1 

2 

3 

4 

5 

D 

1 

00000 

$2,889 

0 

0 

0 

0 

0 

2809 

2 

00001 

2,297 

0 

0 

0 

0 

1217 

1080 

3 

00002 

2,133 

0 

0 

0 

0 

1816 

317 

4 

00003 

2,153 

0 

0 

0 

0 

2080 

73 

5 

00004 

2,194 

0 

0 

0 

0 

2181 

14 

6 

00011 

2,075 

0 

0 

0 

1017 

745 

313 

7 

00101 

2,100 

0 

0 

822 

0 

853 

425 

8 

00111 

2,042 

0 

0 

822 

596 

506 

118 

9 

10011 

2,041 

3t6 

0 

0 

873 

640 

213 

10 

11111 

2,074 

316 

386 

410 

428 

450 

84 

11 

11011 

2,053 

316 

386 

0 

683 

534 

133 

12 

10111 

2,043 

316 

0 

652 

506 

473 

97 

u 
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rciiior\  cS  was  rcaclit.'d  This  would  happen  il 
we  tried  addiiuj  a  line  in  band  3  before  adding 
a  line  in  hand  1  Configuration  9  could  still  be 
reached  via  the  path  8,  9,  i  e  by  rolling  one 
line  from  bana  3  to  band  1  Configuration  9 
could  also  be  reached  from  a  balanced  solu¬ 
tion  like  Configuration  10  through  the  path 
10.  11.  and  9  Thus  we  see  that  Configuration 
9  IS  reachable  in  a  variety  of  ways  This  is  en¬ 
couraging.  We  do  not  know  if  Configuration  9 
is  optimal. 

While  we  cannot  be  sure  that  the  procedure 
will,  in  fact,  find  the  optimum,  the  data  in 
Table  4  is  very  encouraging  First,  we  see  that 
the  paths  to  the  best  solution  do  exist  Second, 
we  see  that  the  solution  space  is  very  flat 
Most  dead  ends,  if  any  exist,  are  likely  to  be 
within  1%  of  the  optimum. 

Similar  conclusions  can  be  drawn-  from 
Table  5A.  which  is  the  result  of  an  experiment 
with  10  Erlangs  per  day  per  band.  Here  again, 
the  best  solution.  (*171,  is  reachable  from  the 
band  5  solution  and  balanced  configuration 
(*15).  Here  again,  both  the  band  5  and 
balanced  solutions,  and  hence,  any  dead  ends 
reachable  fi-om  them,  are  within  1%  of  the 
optimum.  The  carried  load  and  cost/hour  are 
given  in  Table  5B  for  Configuration  17.  They 
indicate  an  increasing  cosi/hour  as  the  band 
increases  with  the  most  substantial  jump  be¬ 
tween  band  5  and  DDD  Other  experiments 
with  3  Erlangs/ day/band  and  30  Erlangs/ 
day/band  were  run  Tfie  results  weie  similar 
to  those  reported  here 


Table  SA.  10  ErlangrSand 


Number 

Configuration 

Cost 

1 

<i00C0 

$28,686 

2 

00005 

21,354 

3 

0000,10 

18.868 

4 

0000.11 

18.851 

5 

0000,12 

18.940 

6 

00066 

17,981 

7 

00336 

17.740 

8 

01236 

17.670 

9 

01245 

17,616 

10 

10236 

17.582 

11 

11136 

17,582 

12 

11226 

17.526 

13 

11235 

17,483 

14 

22222 

18,271 

15 

22225 

17,511 

16 

21226 

17  .'77 

n 

1 2:’.5 

17  .161 

Table  SB  Carried  Load  and  Cost/Hour 


Band 

Carried 

Load 

Cost 

Cost/hour 

1 

7.47 

S  2,169 

13.20 

2 

5.04 

1,565 

14.11 

3 

9.95 

3,242 

14.81 

4 

12.74 

4,463 

15.92 

5 

13.84 

5,467 

18.28 

DDD 

.96 

556 

26  26 

TOTAL 

50.00 

$17,461 

The  procedure  was  run  on  several  actual 
problems  using  data  derived  from  actual  call 
tapes  An  experienced  designer  then  attempted 
to  improve  the  solution  manually  using  the  in¬ 
teractive  system  within  which  the  procedure 
was  embedded  In  two  cases  an  enumeration 
was  run.  automatically  stepping  through  all 
possible  solutions.  In  no  case  was  any  im¬ 
provement  observed  Thus,  while  we  have  no 
formal  proof  that  the  procedure  converges  to 
an  optimal  solution,  this  empirical  evidence, 
together  with  the  essentially  unimodal  nature 
of  the  cost  function,  leads  us  .a  believe  that  it 
does  in  fact  yield  globally  optimal  solutions  in 
most  cases. 

5.  Extensions  and  Further  Conclusions 

The  multiple  hour  nature  of  the  problem 
does  not  seem  to  have  much  of  an  effect  on 
the  procedure.  If  an  off-peak  hour  has  traffic 
similar  to  the  peak  hour,  the  optimization 
should  be  unaffected  If  an  off-peak  hour  has 
much  less  traffic .  most  of  its  traffic  will  be  car¬ 
ried  in-band  Neither  situation  seems  to  cause 
a  problem. 

FX  and  tie  lines  could  be  handled  by  increas¬ 
ing  the  number  of  rows  in  the  traffic  matrix  to 
include  candidate  FX  and  tie  lines.  Similarly, 
the  procedure  could  be  extended  to  handle  a 
more  general  class  of  overflow  rules  where 
traffic  from  one  band  is  restricted  to  overflow 
only  to  specific  oiher  bands  In  some  cases, 
where  the  number  of  overflow  options  is 
liinitcrl.  an  explicity  routing  table  would  have 
to  be  entered 

O'-'eueing  arul  blocking  solutions  could  be 
produced  hv  replacing  ih-c  Erlang  B  formula 
by  one  ,’.|ipropr;.ite  to  the  situation  The  con 
ri'i'K.ns  i'.i.iwn  tiom  the  evperimert''  above 
-.re  piiinipaily  reiulls  <d  Ine  fiinition.  not 
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the  Erlany  B  function  Tfiey  are  thus  likely  to 
hold  in  the  other  situations  as  well 
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ABSTRACT 

Ve  address  the  problem  of  designing  a  circuit- 
switched  network  for  voice  communications  operating  in  a 
military  environment.  The  circuit-switched  network 
design  problem  may  be  briefly  stated  as:  given  the 
topology,  route  tables  and  control  discipline,  end-to-end 
offered  traffic  and  performance  requirements,  determine 
trunk  group  sizes  su^  that  the  requirements  are  satisfied. 
One  of  tte  key  requirements  of  a  design  is  that  the 
network  be  survivable,  where  survivability  is  based  on 
different  destruction  scerurio  conditioru.  Our  objective 
Is  to  guarantee  an  acceptable  level  of  performance  for 
every  rode  pair  and  under  each  of  the  different 
antidpated  damage  scenarios.  The  main  contributions  of 
our  present  work  are  the  development  of  approaches  for 
designing  networks  tliat  simultaneously  satisfy 
performance  requirements  for  different  destruction 
scenarios. 

Ve  desaibc  the  architecture  of  a  survivable,  dreuit- 
switched  network.  The  key  characteristics  of  the 
survivable  network  design  problem  are  highlighted  and 
differences  with  respect  to  dassical  trunk  sizing  problem 
arc  pointed  out.  An  important  aspect  of  our  work  is  that 
the  sizing  is  based  on  the  logical  topology  of  the  network 
rather  than  its  trunk  group  topology.  One  design 
approach,  which  we  have  used  successfully,  is  presented  in 
detaU. 

1.  WTRODUenON 

In  this  paper,  we  address  the  problem  of  designing  a 
circuit-switched  network  for  voice  communications 
operating  in  a  military  environment.  The  drcuit-switched 
network  design  problem  may  be  briefly  stated  as:  given 
the  topology,  route  tables  and  control  disdpline,  end-to- 
end  offered  traffic  and  performance  requirements, 
determine  trunk  group  sizes  such  that  the  requirements 
are  satisfied.  One  of  the  key  requirements  of  a  design  is 
that  the  network  be  survivable,  where  survivability  is 
based  on  different  destruction  scenario  conditions. 
Survivability  constraints  on  a  network  design  include:  a) 
connectivity  requirements,  and  b)  end-to-end  performance 
requirements. 

Satisfying  connectivity  requirements  is  a  classical  and 
relatively  well  studied  problem.  The  main  contributions 
of  our  present  work  are  the  development  of  approachies 
for  designing  networks  that  simultaneously  satisfy 
performance  requirements  for  different  destruction 
scenarios.  Our  objective  is  to  guarantee  an  acceptable 
level  of  performance  for  every  node  pair  and  under  each 
of  the  different  anticipated  damage  scenarios.  This 
objective  is  more  general,  and  in  our  opinion  more 
ap^opriate,  than  the  goal  of  ensuring  connectivity 


requirements.  Ve  avoid  the  pitfall  of  designing  networks 
that  satisfy  connectivity  constraints,  but  exhibit 
unacceptably  poor  performance.  An  important  aspect  of 
our  work  is  that  the  design  is  based  on  the  logical 
topology  of  the  network  rather  than  its  trunk  group 
topology. 

The  optimal  design  of  circuit-switched  networks  for 
voice  communications  is  complicated  due  to  a  variety  of 
reasons.  In  order  to  appreciate  this  fact,  we  first  review 
the  salient  characteristics  of  military  networks  that 
distinguish  it  from  commercial  telephony.  They  typically 
have  a  mesh  connected  topology  (switch  interconnections) 
and  use  non-hierarchical  alternate  routing  schemes. 
Other  important  operational  features  that  are  unioue  to 
military  networks  include  the  presence  of  multiple  levels 
of  traffic  priority,  preemption  of  calls,  provision  for 
multiple  route  search  methods  (normal,  preemptive,  and 
hotline).  The  main  End-To-End  (ETE)  performance 
measures  of  interest  to  a  network  designer  are:  Grade  Of 
Service  (COS),  Probability  Of  Preemption  (POP),  and 
Speed  Of  Setup  (SOS)  of  calls. 

In  this  paper,  we  focus  attention  on  effective  design  of 
networks  that  satisfy  ETE  COS  performance  requirements 
for  multiple  scenarios  (prespecified  by  the  user).  In  the 
next  section,  we  introduce  relevant  terminology  and 
explain  the  topological  representation  of  the  network.  In 
Section  3,  the  key  characteristics  of  the  survivable 
network  design  problem  are  pointed  out.  In  the  final 
section,  we  ^velop  approaches  for  designing  survivable 
networks.  One  such  approach,  which  we  have  used 
successfully,  is  presented  in  detail. 

It  is  worth  noting  that  recent  literature  [1]  and  [2] 
contain  approaches  to  the  dimensioning  of  networks  for 
more  than  one  busy  hour.  These  multi-hour  sizing 
algorithms  address  a  problem  conceptually  similar  to  ours, 
in  the  sense  that  requirements  of  each  busy  hour  may  be 
thought  of  as  a  scenario.  However,  the  methods  are 
largely  inapplicable  to  us  because  they  consider  networks 
Commercial  telephony)  with  hierarchical  routing,  while 
our  interest  is  in  dimensioning  networks  with  non- 
hierarchicaJ  routing. 

2.  TOPOLCXIICAL  REPRESENTATION 

In  this  section,  we  fix  relevant  terminology  and 
corK:epts.  Ve  also  elaborate  briefly  on  the  architecture 
of  a  typical  survivable  voice  communication  network  and 
set  the  framework  for  subsequent  sections  of  the  paper. 
Our  discussion  will  focus  on  the  backbone  non-hierarchical 
switched  part  of  the  overall  communications  system. 
Users  are  allowed  to  be  connected  via  dedicated  local 
loops  to  the  switch  sites  and  routed  via  the  backbone 
switched  network  to  the  destination  handset. 
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The  backbone  topology  of  a  survivabir  network  is  dealt 
at  three  level:  of  detail.  These  are:  1)  physical  topology, 
2)  logical  topology,  and  3)  trunk  group  tspology.  A 
physical  topjloKV  is  comprised  of  a  set  of  physical  sites 
(e.g.,  switch  sites,  radio  relay  stations),  and  physical  links 
(e.g.,  microwave  links)  as  depicted  in  Figure  1.  The 
physical  elements  (sites  and  links)  are  the  targets  of 
destruction  under  enemy  attack. 
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FIGURE  2;  LOGICAL  TOCOLOGY  AND  LOGICAL 
LINK  COMPOSITION 


A  logical  link  in  a  network  Is  a  path  in  the  physical 
network  between  two  swirch  sites  without  intervening 
switching.  A  logical  link  is  considered  operational  (i.e., 
not  destroyed)  if,  and  only  if,  all  the  physical  elements 
composing  it  are  operational.  A  oair  of  switches  may  be 
connected  by  more  than  one  logical  link.  If  such  is  the 
case,  the  logical  links  represent  conceptually  different 
communication  paths  between  the  two  switches.  .1 
multiple  logical  links  exist  between  a  switch  pair,  there 
may  ^  physical  elements  (other  than  the  terminal  switc.h 
sites)  common  to  hvo  or  more  ol  the  logical  links.  The 
logical  topology  of  a  network  consists  of  a  set  of  switch 
sites  and  logical  links  connecting  site  pairs. 

In  Figure  2,  we  show  a  logical  topology  of  the  network 
whose  physical  topology  is  given  in  Figure  I.  There  are 
several  potential  logical  topologies  one  can  defi.ts  wiin 
respect  to  the  giver  physical  netwsrk  topology,  .snd 
Figure  2  shows  just  one  of  the  choices. 


A  trunk  group  between  two  switch  sites  consists  ol  all 
the  logical  links  that  connect  the  site  pair.  II  na  logical 
link  con.nects  a  site  pair,  then  there  exists  no  trunk  group 
between  the  site  pair.  The  trunk  erouc  tcaoloev  of  a 
network  consists  of  a  set  of  switch  sites  and  uuok  groups 
connecting  site  pairs  as  shown  in  Figure  ).  A  route 
between  a  switch  pair  is  specified  as  a  sequence  of  trunk 
groups. 

Connectivity  requirements  on  a  design  stipulate  ilsat 
for  each  switch  pair,  there  must  be  at  least  a  prespeciiied 
number  of  physically  disjoint  paths  in  trie  physical 
topology  connecting  the  switches.  Connectivity 
constrair.ts  are  tooolcgicsi  in  nature  and  are  taken  inic 
accouiU  in  selecting  tne  iogical  topology  and  routes  for 
diifeier.t  switch  p.\irs. 
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FIGURE  3:  TRUNK  GROUP  TSPOLOGY 


Associated  with  each  physical  link  is  its  maximum 
capacity.  It  is  expressed  as  the  number  cf  telephone  calls 
that  the  link  can  transmit  at  the  same  time.  The  size  (S|) 

of  logical  link  /.  is  the  number  of  trunks  it  contains,  and 

cannot  exceed  the  capacity  of  minimum  of  component 
physical  links  in  it.  The  size  of  a  trunk  group  is  the  sum 
of  the  sizes  of  logical  links  that  comprise  it.  Figures  1 
through  3  indicate  the  sizes  of  a  hypothetical  design. 

The  cost  c.  (doUars/month)  of  logical  link  consists 

of  two  components:  1)  a  fixed  cost  (b.)  for  establishing 

the  logical  link,  and  2)  a  variable  cost  which  is 
proportional  to  its  size.  If  the  cost  per  circuit  in  is  a., 

the  total  cost  c.  is  equal  to  (b.  «  a.s.).  The  specific  choice 

of  a  logical  topology  determines  the  fixed  costs  of  the 
network.  The  total  cost  of  the  network  is  the  sum  cf  its 
fixed  costs  and  variable  costs.  The  design  problem, 
considered  in  the  subsequent  sections,  is  concerned  with 
the  determination  of  a  feasible  set  of  logical  link  sizes 
such  that  the  variable  costs  (i.e.,  trunk  costs)  are 
minimized.  Although  not  addressed  in  this  paper,  the 
designer  interested  in  minirriizing  the  total  cost  of  the 
network  must  consider  the  choice  of  logical  topology  as 
part  of  the  overall  design  problem. 

3.  DESIGN  PROBLEM 

The  classical  network  design  problem  may  be  stated  as 
follows.  Given  the  trunk  group  topology,  ollered  ETE 
traffic  and  performance  '.GOS)  requirements,  v,d  routes 
and  route  control  discipline,  determine  a  set  of  trunk 
group  sizes  such  that  traffic  and  perfccmance 


requirements  aue  satisfied.  The  above  problem, 
(henceforth  referred  to  ;n  this  paper  as  the  Trunk  Suing 
Problem)  requires  the  sizing  of  trunk  groups  for  a  sinzle 
scenario.  A  scenario  is  an  operating  condition  of  the 
network  characterized  by  its  unique  set  of  traffic  and 
performance  requirements. 

The  classical  Trunk  Sizing  Probleni  in  networks  with 
non-hierarchicaJ  routing  has  been  well  studied  in  the 
literature  [3]  and  [<*]  .  Segal's  [3]  algorithm  is  based  on 
single-moment  assumption  (i.e.,  calls  offered  to  each 
trunk  group  have  a  Poisson  distribution)  and  is  applicable 
for  networks  with  e'ther  originating  office  control  or 
successive  office  control.  Covo  1.“]  considered  the 
presence  of  multiple  precedence  levels  of  traffic  and 
developed  an  approximate  method  for  sizing  irunK  groups. 
However,  Covo's  work  addresses  the  requirements  of  only 
a  single  scenario.  For  our  computational  studies,  we  used 
Segal's  algorithm  for  feasible  trunk  group  sizing.  We  do 
not  repeat  Segal's  algorithm  here,  but  refer  the  reader  to 
[3]  for  details. 

The  problem  of  designing  a  survivable  network 
(henceforth  referred  to  as  SND  Problem)  requires  a  sizing 
of  trunk  groups  and  logical  links  that  satisfies  the  traffic 
and  performance  requirements  under  multiple  operating 
conditions  of  tlie  network.  This  is  the  most  important 
characteristic  that  distinguishes  the  probism  from  tfie 
Trunk  Sizing  Problem.  Another  unique  c.haracteristic  c£ 
the  SND  problem  is  the  dichotomy  between  trunk  group 
sizing  and  logical  link  sizing.  The  key  aspects  of  this 
distinction  between  trunk  group  sizing  and  IcgicaJ  link 
sizing  are  as  follows.  In  classical  studies,  a  physical  link 
is  synonymous  with  a  trunk  group  and  destructicn  of  a 
physical  link  or  a  s'witch  would  result  in  the  total 
destruction  of  any  route  that  uses  it.  Ho'wever,  for  a 
survivable  network,  the  effect  of  destroying  a  physical 
element  (link  or  site)  has  the  lolbwmg  ramifications.  All 
logical  links  tnat  use  the  physical  elcme.tt  are  destroyed. 
The  loss  ol  a  logical  link  would  result  in  reducing  the  size 
of  the  trunk  group  that  contains  the  logical  link.  Thus, 
the  size(s)  of  one  or  more  trunk  group(s)  will  be  reduced. 
The  effect  on  a  route  affected  by  the  physical  element 
destruction  is  one  ol  the  following:  1)  a  reduction  in  size 
of  some  trunk  group(s)  belonging  to  the  route,  or  2)  total 
destruction  of  the  route.  The  latter  occurs  when  ah  the 
logical  links  composing  any  trunk  group  oeionging  to  the 
route  are  destroyed.  UnJikc  thic  ciassicil  Trur.K  Sizing 
Problem,  the  loss  ol  a  physical  element  does  not 
necessarily  imply  the  total  destruction  of  one  or  more 
.*outes. 

We  illustrate  the  above  facts  with  an  example. 
Consider  the  network  whose  iogicaJ  tcpoicgy  and  iogicai 
link  composition  are  shown  in  Figure  2.  The  impact  of 
destroying  relay  site  Rj  (see  Figure  1)  would  be  to  oestroy 

logical  links  L^g,  Lg  and  that  use  Rj  (see  logical  link 

composition  in  Figure  2).  These,  in  turn,  imply  a 
reduction  in  size  ol  trunk  group  Tj^  (containing  L^g)  from 

32  to  20.  It  can  be  observed  that  size  of  trunk  group  T. 

(containing  L^y^)  is  reduced  from  33  to  23,  while  trunk 

group  T^  (containing  a  single  logical  link  L^)  is  totally 

destroyed  (size  reduced  to  zero).  Thuv  the  only  routes 
totally  disrupted  would  be  those  cont.unmg  T^,  wruie  the 

rest  ol  the  routes  are  likely  to  operate  at  a  derraoeo 
performance  level. 

For  a  given  scenario,  the  ETC  COS  performance  of  a 
network  is  a  function  of  the  trunk  gro-o  topclcf,*,  trunk 
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group  lizes,  routes,  an'J  ETE  offered  traffic  of  that 
scenario.  However,  in  designing  a  survivable  network,  we 
have  to  size  the  togical  links  such  that  for  each  scenario, 
the  trunk  group  sizes  (as  determined  by  the  surviving 
logical  links)  are  adequate  to  meet  the  requirements  cl 
the  scenario. 

The  specification  of  the  design  problem  must  precisely 
characterize  the  scenarios  of  interest.  As  an  example, 
our  interest  is  in  sizing  a  network  with  the  following 
requirements.  Under  a  normal  scenario  in  which  no 
network  elements  are  destroyed,  the  network  must  be 
capable  of  satisfying  a  nominal  set  of  offered  traffic  and 
performance  requirements.  The  performance  requirement 

is  G  ,  the  maximum  tolerable  end-tO’Cnd  COS.  The 
n>4x 

value  of  G _ could  be  0.01.  Under  an  overload  scenario, 

maX 

rto  network  elements  are  destroyed,  but  the  offered 
traffic  between  each  node  pair  increases  to  some  c  (say, 
c  X  l.S)  times  the  nominal  offered  traffic  between  tlw 
pair.  The  network  sizing  must  satisfy  the  overload  traffic 
requirement  at  a  less  stringent  performance  level  (say, 

G  s0.10).  In  addition,  we  require  that  whenever 
mu 

exactly  one  physical  element  is  destroyed,  the  surviving 
network  must  be  capable  of  handling  the  nominal  traffic 
requirements  at  a  degraded  performance  level  (say, 
■  0.1S).  Vt  note  that  for  the  physical  network 

nflAX 

shown  in  Figure  1,  there  arc  21  physical  elements 
M  switches,  4  relays,  and  1 1  links)  and  corresponding  to 
each  clement  a  damage  scenario  is  identified. 

In  the  next  section,  we  discuss  Two  basic  design 
philosophies  for  realizing  feasible  and  near  minimum  cost 
survivable  network  designs. 


(1  £  j  <  $),  identify  the  trunk  group  topology,  surviving 

routes,  ETE  offered  traffic,  and  oerfor'-nsner 
requirements.  Apply  Segal's  algorithm  [J]  and  compute  a 
feasible  set  of  n'unk  group  sizes  satisfying  the  ETE 
requirements  of  scenario  j.  Let  a^j  be  the  oesired  size  lor 

trunk  group  i  (1  <  i  <  t,  where  t  is  the  total  numoer  ol 
trunk  groups  unde7  the  normal  sceriano)  in  scenario  j. 

Step  2  requires  the  calculation  of  logical  link  sizes 
based  on  the  trunk  group  sizes  (computed  in  Step  1)  sc 
that  the  total  monthly  dollar  costs  are  minimized. 

"f/t  devote  the  rest  of  this  Section  (4.1)  to  address 
Step  2.  Consider  a  specific  trunk  group,  say  i.  Let  the 
trunk  group  contain  m  logical  links  and  let  n.^^^  represent 

the  number  of  circuits  in  logical  link  k  of  trunk  grevo  i. 
Further,  let  Xj.j^  be  a  binary  variable  defined  as  follows: 


p  if  logical  link  k  of  trur.k  group  i  is 
I  destroyed  in  scenario  j 


xhk 


1(1  <  J  <  I,  1  <  k  <  m) 
otherwise 


Let  b.|^  be  the  cost  per  circuit  in  logical  link  k  in  the 

trunk  group  i.  The  minimum  cost  logical  link  assignment 
(MCLLA)  problem  is  stated  as  follows: 


Minimize: 


t  m 


EE 

ixlkxl 


‘“ik^ik 


(1) 


4.  DESIGN  APPROACHES 

Wc  consider  two  basic  philosophies  for  sizing 
survivable  networks.  We  assume  the  following  are  given  • 
physical  topology,  logical  topology  and  logical  link 
composition  (under  normal  scenariol,  routes,  physical  link 
sizes,  a  precise  cnaracterization  of  the  scenarios  and 
their  requirements.  The  objective  is  to  compute  logical 
link  sizes  such  that  the  requirements  of  each  scenario  are 
satisfied.  Two  basic  design  approaches  we  considered  are: 

L  PvalJel  Approaches. 

Z  Serial  Approaches. 

4.1  Parallel  Approaches 

Suppose  a  network  requires  sizing  of  logical  links  and 
Wunk  groups  to  satisfy  traffic  and  performance 
requirements  for  s  different  scenarios.  The  basic 
philosophy  of  a  parallel  approach  is  that  a  feasible 
survivable  design  may  be  obtained  by  independent 
•olutloni  of  many  instances  of  the  trunk  sizing  problem 
(one  instance  for  each  scenario)  and  then  an  appropriate 
combination  of  the  solutions  to  determine  the  logical  link 
alzes.  Thus,  the  two  major  steps  in  the  parallel  approach 
are: 

Step  It  Determining  Trunk  Group  Size 
Requirementt. 

Step  2i  Assign  Capacity  to  Logical  Links  to  Satisfy 
'  Trunk  Group  Size  Requirements. 

Step  1  is  accomplished  as  follows.  For  each  scenario  j 


Subject  to;  m 

I,  *'»■  (2. 

The  objective  function  (1)  represents  the  total  cost  of 
orcuits  in  all  logical  links  of  the  given  logical  topology. 
There  are  s  linear  inequalities  in  (2)  (cr  each  trunk 
group  i,  and  their  interpretation  is  as  follows.  For 
scenario  j  and  trunk  group  i,  the  totil  number  of  circuits 
that  survive  (given  by  the  left  side  of  the  inequality)  must 
be  at  least  equal  to  a.|,  the  desired  size  for  trunk  group  i 

under  scenario  j.  The  constraints  on  the  design  problem 
may  mandate  a  minimum  size  requirement  (/|^1,  and 

■maximum  size  limit  (U||^)  for  each  logical  link  k.  These 

cerstraints  are  given  by  (3). 

In  addition  to  constraints  (2)  and  (3)  that  are  specific 
to  trunk  group  i,  there  are  constraints  on  the  number  of 
circuits  that  can  be  used  in  a  physical  link.  If  s  number  of 
logical  links  (possibly  belonging  to  many  different  trunk 
groups)  use  a  physical  link,  the  combined  circuit 
requirement  of  the  logical  links  using  the  physical  link 
cannot  exceed  the  maximum  specified  limit  for  the 
physical  link.  These  constraints  must  be  handled  by  the 
logical  link  assignment  algorithm. 

The  MCLLA  problem  formulated  above  could  be  ap¬ 
proached  via  an  integer  linear  prog'amming  'ILP) 
algorithm.  However,  rather  th.in  applying  a  computa¬ 
tionally  expensive  ILP  algorithm,  we  have  cevel&pec 
following  heuristic  algorithm  to  achieve  a  inearl  minimum 
cost  design. 
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Our  approach  is  based  on  decomposition  of  objective 
function  (1)  into  t  separable  components  (one  (or  each 
trunk  group)  and  minimizing  each  component  incividually. 
The  t  trunk  groups  are  assigned  to  their  respective  logical 
links  by  considering  them  sequentially.  Suppose  Oj.  e,...., 

e,  is  a  sequence  in  which  the  trtetk  groups  are  considered 

lor  logical  link  assignment.  When  the  logical  links  of 
trunk  group  e.  (1  <i  m)  are  assigned,  the  sizing  deciuons 

made  earlier  for  logical  links  of  trimk  groups  e^. 
ejfxCj  I  ^  treated  as  constraints.  These 

constraints  arise  due  to  the  fact  that  (i)  physical  links 
may  be  shared  by  logical  links  belonging  to  different  trunk 
groups  and  (ii)  upper  bounds  on  size  of  physical  links. 

An  outline  of  the  steps  of  the  MCLLA  assignment 
algorithm  is  given  below. 


Step  a: 

Choose  a  sequence  in  which  trtetk  groups  arc 
to  be  considered  for  logical  link  assignment. 
Without  loss  of  generality,  let  this  sequence 
be  l,2,...,t. 

Step  b: 

i  ♦  1. 

Step  c 

Assign  circuits  to  logical  links  of  trunk 
group  i. 

Step  d> 

i  *  i  ♦  1. 

Step  e: 

If  i  >t  then  stop;  else  repeat  Step  c. 

MCLLA  Alzorithm 

Step  a  of  thie  MCLLA  algorithm  is  accomplished  as 
follows.  There  is  a  large  number  (t!)  of  possible  sequence, 
in  which  the  trunk  groups  couid  be  considered  for  logical 
link  assignment.  Since  the  final  solution  (total  cost  of 
logical  link  circuits)  depends  on  the  sequence  in  which 
trvnk  groups  are  considered,  we  consider  trunk  groups  in  a 
sequence  determined  by  decreasing  order  of  their  size 
requirement.  For  each  trvr.k  group  i(l  <  i  <  t),  compute 
the  variable  h|  (highest  requirement  of  trunk~ group  i  over 

all  scenarios). 

r“<.  H 

Sort  the  h^'s  in  non-increasing  order.  Consider  the  trunk 

groups  for  logical  link  assignment  in  the  sequence 
determined  by  the  above  sort  step. 

We  now  consider  Step  c  in  the  MCLLA  algorithm  and 
describe  how  the  algorithm  works  in  meeting  the  size 
requirements  of  a  specific  trunk  group  i.  The  algorithm 
first  considers  scenarios  in  whicn  only  one  logical  link  of 
trunk  group  i  survives,  then  it  considers  scenarios  in  which 
two  logical  links  survive,  and  so  forth.  In  assigning 
circuits  to  fulfill  the  requirements  of  scenario  j,  suppose 
there  is  a  choice  to  assign  arcuits  among  nvo  or  mere 
logical  links.  The  assignment  algorithm  assigns  arcuits 
first  to  the  least  expensive  (per  circuit  cost)  iogicaJ  link, 
in  that  scenario.  The  assignment  is  made  until  either 

of  the  following  two  conditions  becomes  true: 

I.  All  the  circuit  requirements  of  scenario  j  have 
been  fulfilled. 


2.  The  maximum  limit  on  the  capacity  of  some 
physical  link  p  in  logical  link  is  reached  or  tiic 

upper  bound  on  the  size  of  is  reached. 


if  condition  (i)  becomes  true,  then  we  are  done  with 
the  assignment  meeting  tlie  requirements  cf  scenario  j.  if 
condition  (2)  becomes  true,  then  the  renaming  arcuit 
requirements  of  scenario  j  are  fulfilled  by  considering  the 
second  least  expensive  logical  link/2  that  survives  in 

scenario  j.  If  necessary,  the  algorithm  also  considers  :ne 
third  least  expensive  logical  link,  and  so  forth. 

For  the  purpose  of  illustration  of  Step  z,  consider  the 
following  example.  Let  a  trunk  group  have  m  (s4)  logical 
links  which  must  be  sized  to  meet  the  trunk  group's 
requirements  in  s  k-i)  scenarios.  Let  the  required  sizes  of 
the  trunk  group  be:  a^  =  59,  82  »  ^7,  a^  :  61,  a^  =  <*0, 

a^  s  85.  Let  the  cost  per  circuit  in  the  logical  links  be: 

b|  s  10,  b2  ~  20,  bj  3  8,  b^  s  12.  Let  the  upper  bound  for 

each  logical  link  be  50  and  the  lower  bound  for  each  be  0. 
Let  X,  shown  in  Table  1,  be  a  5  x  4  matrix  such  that  the 

(i«k)*^  entry  is  1  (or  0)  depending  on  whether  logical  link 
k  (I  <  k  <  4)  survives  (or  fails)  in  scenario  j  d  <  j  <  5)‘  The 
iogic^  link  sizes  n^^  (1  <  k  <  4)  are  computeo  as'foriows: 

TABLE  1:  WftTRlX  X 

LOGICAL  LINK  K 


SCENARIO  J 

1 

2 

3 

4 

5 


12  3  4 

10  10 

0  1  1  0 

10  0  1 

0  0  0  1 

1111 


First,  scenario  4,  with  only  one  surviving  logical  link 
(/^)  is  considered.  Its  size,  n^,  is  set  to  40,  the  requirec 

trunk  group  size  in  scenario  4.  Of  the  remaining 
scenarios,  scenarios  1.2,  and  3  have  two  surviving  logical 
links  each  and  scenario  5  has  all  four  logical  links  in  tact. 
The  constraint  on  scenario  1  is  that  logical  links  /j  and  ^ 

be  sized  such  that  their  combined  size  (hj  *  n^)  is  >  19. 

Since  /j  is  cheaper  than  circuits  are  assigned  to 

without  violating  the  upper  bound  constraint  on  ;ts  size. 
Hence,  n,  is  set  at  the  maximum  permissible  value  of  50. 

Since  this  (jUIs  short  of  the  required  size  by  9  circuits,  the 
value  of  nj  is  set  to  9.  For  scenario  2,  (nj  »  Oj)  must  be 

>57.  The  algorithm  first  considers  /j  which  is  cheaper 

than  Since  /,  is  at  its  maximum  permissible  size, 

circuits  are  assigned  to  the  second  least  exoensive  logical 
link,  /j.  Oj  is  set  to  7.  The  algorithm  proceeos  ..1  a 

similar  manner  and  ends  up  with  the  lolioving  logical  link 
sizes: 

nj  3  21,  n2  >  7,  rij  X  50,  n^^  s  4C. 


6 


*.2  Sei  iai  Approaches 

The  basic  philosophy  o!  a  serial  approach  is  that  a 
survivable  network  design  may  be  obtained  by 
considering  the  requirements  of  each  scenario  in 
sequence.  The  initi^  goal  is  to  identify  the  scenario 
whose  requirements  are  the  rncst  dominant  (according  to 
a  criterion,  well-defined  by  the  designer)  over  ejl  tne 
scenarios.  The  trunk  groups  are  ci..ed  and  the  required 
trunk  group  circuits  are  apportioned  to  the  logical  Unks 
that  survive  in  the  most  dominant  scenario.  When  she 
requirements  of  a  nori-dominant  scenario  j  are 
considered,  a  baseline  design  is  first  derived  from  a 
knowledge  of  sizes  of  logical  links  dimensioned  prior  to 
scenario  j.  The  requirements  of  scenario  j  are  sought  to 
be  fulfill^  by  incremental  rr.odifications  with  respect  to 
the  baseline  ^sign. 

Despite  the  intuitive  appeal  of  the  serial  approach  in 
designing  networks  with  nearly  "similar"  scenarios,  there 
are  a  number  of  disadvantages  to  it.  Our  assessment  is 
that  the  accuracy  of  the  serial  approach  is  quite 
sensitive  to  the  sequence  in  whi(±  scenvios  are 
considered.  Furthermore,  the  accuracy  depends 
significantly  on  the  use  of  effective  algorithms  for 
incremental  sizing  (from  one  scenario  to  the  next). 
Effective  algorithms  for  nargmai  sizing  are  known  in 
the  context  of  hierarchical  networks  [5] ,  though  not  ss 
much  is  known  for  the  rion-hiera/6.i<-ai  routing  network's 
of  interest  to  us.  A  third  and  very  significant 
disadvantage  of  the  serial  approach  is  i»  extremely  hign 
computational  time  requirement  when  compared  to  the 
parallel  approach.  When  a  baseline  design  is 
incrementally  modified  to  meet  the  requirements  of  a 
new  scenario,  the  ETE  grade-of-service  must  be 
computed  to  verify  if  indeed  the  requiremenu  have  been 
satisfied.  Thus  the  serial  approach  requires  many 
computationally  expensive  perforinance  analysis  runs  as 
part  of  its  ircremental  sizing  steps.  From  our 
computational  experience,  we  nave  found  the  parallel 
approach  to  yield  good  feasible  designs  within  reasonable 
computer  time. 
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Delay  and  Overhead  in  the  Encoding  of  Data  Sources 

J.  F.  HAYES.  sbNiok  mlmmf.ii.  ibF.b.  and  R  K.  BOORSTYN,  memu*.  ieee 


Absiraci — A  basic  properly  of  data  soorccs  la  laltractlv*  appU* 
rations  is  bursliness.  i.c.,  short  periods  of  aclirity  foUooed  by  loof 
idle  periods.  In  these  same  applications  message  delay  is  the  primary 
performance  criterion.  The  combination  of  bnrsty  flan  and  a  delay 
criterion  leads  to  a  source  encoding  problem  in  which  delay  plays  a 
central  role.  A  salient  feature  of  Ibis  problem  is  that  there  la  a  tradeoff 
between  delay  and  the  number  of  protocol  bits  required  to  represent 
the  slate  of  the  source. 

A  simple  model  of  a  bursty  source  is  studied  with  the  objective  of 
understanding  the  relationship  between  coding  cfficicBcy  and  delay. 
Two  encoding  schemes,  a  block  encoding  Icchnitiue  and  a  technique 
employing  flags,  are  esamined  in  some  detail.  For  bath  the  block 
encoding  and  the  Hag  schemes,  a  significant  result  it  that  as  the  source 
becomes  lets  bursty,  delay  grows  without  bound.  This  result  is 
obtained  in  spite  of  the  fact  that  both  schemes  are  reasonable  and  in 
the  limit  the  encoding  problem  disappears.  It  also  appears  that  the 
flag  encoding  technique  hat  much  smaller  delay  than  black  encoding. 

INTRODUCTION 

HIS  paper  is  concerned  with  the  interplay  of  delay  and 
overhead  in  the  transmission  of  data  traffic  over  a  syn* 
chronous  channel.  The  relationship  of  these  quantities  is  due 
to  certain  unique  properties  of  data  in  an  interactive  environ¬ 
ment.  DaU  traffic  is  bursty,  i.e.,  there  are  short  busy  periods 
interspersed  with  relatively  long  idle  periods  which  contain  no 
information.  (This  stands  in  contrast  to  voice  and  video, 
where  pauses  or  blanks  may  be  significant.)  The  second  rele¬ 
vant  property  of  dau  is  the  effect  of  delay.  In  many  applica¬ 
tions  dau  traffic  it  highly  interactive,  placing  a  premium  on 
rapid  response  times.  The  relationship  of  these  quantities  al¬ 
lows  one  to  formulate  a  source  encoding  problem  which  con¬ 
siders  delay  as  a  basic  component.' 

Clearly,  the  long-term  average  rate  of  data  generation  at  the 
source  must  be  lest  than  the  capacity  of  the  synchronous 
channel.  However,  long-term  averaging  will  lead  to  a  violation 
of  the  delay  requirement  and  a  reduced  encoding  interval  is 
indicated.  Short-term  encoding  leads  to  increased  overhead 
since  there  will  be  timet  when  there  is  nothing  to  transmit:  a 
source  state  which  mutt  be  conveyed  to  the  receiver  unambig¬ 
uously.  In  the  sequel  tJie  problem  of  striking  a  balance  be¬ 
tween  overhead  and  delay  it  studied  for  a  basic  source  model. 
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>  The  cciinection  of  these  tsvo  quantities  hat  been  discussed  by 
Lam  1 1 1  in  another  context. 


To  1  large  e.xtent  the  rule  of  delay  in  this  context  has  not 
been  considered  by  information  theorists.  The  earliest  related 
work  wu  done  by  Jelinek  (2)  and  wu  continued  in  collabora¬ 
tion  with  Schneider  (3] ,  [4] .  The  model  we  consider  is  de¬ 
picted  in  Fig.  1 .  A  source  produces  symbols  at  a  fixed  rate. 
If  the  symbols  produced  by  the  source  are  not  equally  prob¬ 
able,  then  by  using  variable  length  codes,  such  u  Huffman 
codes,  the  average  number  of  bits  that  need  to  be  transmitted 
over  the  channel  in  a  time  interval  can  be  minimized.  In  order 
to  implement  this  technique  buffering  is  required  since  the 
channel  operates  at  a  fixed  rate  and  the  bits  are  produced  by 
the  encoder  at  a  variable  rate.  Jelinek  and  Schneider  studied 
the  relationship  between  the  probability  of  buffer  overflow 
and  efficient  source  encoding. 

We  shall  consider  a  simple  model  of  a  source  which  em¬ 
bodies  these  basic  quantities.  Several  techniques  for  encoding 
this  source  are  studied  with  the  focus  upon  delay  rather  than 
storage.  A  particular  technique  using  flagg  is  examined  in 
derail.  Related  work  on  the  topic  hat  been  done  by  Gallager 

(5] ,  who  applied  rate  distortion  theory  to  the  coding  prob¬ 
lem  with  delay  at  the  distortion  measure.  Recently  Humblet 

(6]  hat  considered  the  problem  of  encoding  randomly  arriving 
messages  to  at  to  minimize  delay. 

SOURCE  MODEL 

Consider  the  following  model:  a  source  produces  either  a 
blank  or  a  binary  digit  once  every  second  with  the  following 
probabilities. 

J»tblankJ  -  1  -P 

J*  [0  output)  «^/2  (I) 

J*)!  output]  mPI2. 

Successive  tymboit  are  independent.  The  sequence  of  O’t  and 
I’t  produced  by  this  source  it  to  be  carried  over  a  synchronous 
error-free  binary  channel  with  a  capacity  of  one  bit  per  second. 
It  is  assumed  that  blanks  are  of  no  interest  to  the  information 
sink  at  the  receiver.  This  source  produces  data  at  an  average 
rate  of  P  bits/t.  Thus  for  ^  <  1  there  will  be  times  when  the 
source  has  nothing  to  transmit,  and  this  condition  of  the 
source  must  be  conveyed  over  the  channel  by  means  of  over¬ 
head  bits  transmitted  along  with  information  bits.  As  we  shall 
MC.  the  number  of  these  overhead  bits  that  are  required  is  a 
function  of  the  allowable  delay. 

An  example  of  the  relationship  between  delay  and  the  over¬ 
head  information  it  given  by  the  following  encoding  proce¬ 
dure.  All  of  the  source  outputs  in  an  /  second  interval  are  en¬ 
coded  in  a  block.  In  each  /  second  interval  the  information 
that  is  transmitted  is  the  number  of  data  bits  generated  by  the 
source  and  llieii  values.  If  more  than  /  bits  are  required  the 
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Fig.  I.  Source  encoding  model. 


excess  is  buffered.  If  tliis  requires  less  than  /  bits,  zeros  can  be 
transmitted  in  the  slack  interval  remaining  aftei  the  buffered 
bits  with  no  ambiguity.  Since  the  position  of  blanks  in  an  / 
second  interval  is  of  no  interest,  the  number  of  actual  data 
bits  produced  by  a  source  in  an  /  second  interval  follows  a 
binomial  distribution.  The  entropy  of  the  source  over  an  I 
second  interval  is 


I 


H(l.  P)^Pl-X  *(«:  n  log,  b(«;  /.  P) 
r-o 


where 


b(r.l,P)  = 


P‘U  -P)'-'. 


(2) 


In  Fig.  2  we  show  plots  of  //(/,  F)//  as  a  function  of  /  for 
various  values  of  P.  In  fact,  it  can  be  shown  that 


lim 


I 


indicating  that  the  portion  of  capacity  devoted  to  overhead 
becomes  zero  as  the  block  size  is  increased.  There  is  a  price  to 
be  paid  in  performance  since  bit  delay  increases  as  the  block 
length  increases.  Thus,  by  increasing  /,  the  block  length,  and 
by  holding  P  fixed,  the  source  can  be  matched  to  the  channel 
in  the  sense  of  H(l,  P)ll  <  1  (see  Fig.  2).  On  the  other  hand, 
we  can  show  that  for  I  fixed,  //(/,  P)/l  >  1  as P approaches  1 . 
Define  1  -  P.  For  «  smaU,  //(/,  1  -  «)  a  (1  -  e)/  -  0  - 

/e)  Iog2(l  -  fe)  -  l«  logj/e.  Setting  H(l,  1  -  e)  >  1  we  find 
that 


-<I  -/e)logj  (1  -/«)>/e(l  +  log.  It).  (3) 

For  0  <  /e  <  RHS  of  (3)  is  negative.  Since  the  LHS 
of  (3)  is  positive  for  0</f  <  1,  the  assertioi}  is  proved. 

The  foregoing  illustrates  the  crux  of  the  problem  of  encod¬ 
ing  of  bursty  sources;  the  amount  of  overhead  required  is  a 
function  of  the  block  length  which  translates  directly  into 
delay.  The  same  phenomena  can  be  seen  operating  for  more 
complex  source  models.  As  an  e.xample  consider  the  case  of 
multiplexing  several  distinct  sources  on  the  same  channel.  If 
the  allowable  delay  is  large  enough,  the  overhead  required  to 
distinguish  the  source  can  be  made  negligible.  For  examine, 
data  from  each  of  the  sources  could  be  collected  in  separate 
buffers.  The  contents  of  these  buffers  could  then  be  time 
division  multiplexed  onto  the  line.  In  this  way  no  overhead  it 
required  to  indicate  the  sources  of  data.  This  procedure 
breaks  down  when  one  or  more  of  the  source  buffers  is  empty. 
However,  by  making  delay  large  enough,  the  occurrence  of  this 
event  will  have  little  effect.^ 


ADAPTIVE  ENCODING 

The  block  encoding  technique  discussed  above  can  be  Im¬ 
proved  upon  by  adding  a  measure  of  adaptivity  to  the  process. 
Assume  that  the  encoding  begins  in  the  same  way  as  in  the  pre¬ 
vious  section  by  using  a  Huffman  code  to  transmit  the  number 
of  data  bits  that  the  source  produces  in  an  /i  second  interval. 
The  probabilities  of  the  number  of  data  bits  are  a  function  of 
/|  u  well  u  P  [sec  (2)} .  Thus  the  Huffman  code  is  in  turn  a 
function  of  .  It  is  presumed  that  the  information  sink  knows 
/i  and  P  and  therefore  knows  the  Huffman  code  that  is  to  be 
used.  Let  us  assume  that  the  transmission  of  the  code  word 
together  with  the  actual  data  bits  consumes  /]  seconds.  The 
source  output  over  the  /j  second  interval  is  encoded  in  the 
same  fashion  as  previowsty.  Since  in  general  /,  is  not  equal  to 
/] ,  the  Huffman  code  used  to  encode  the  number  of  data  bits 
is  different.  However,  the  source  and  sink  can  cooperate  in 
the  construction  of  a  code  since  (2  is  known  to  both.  The 
process  repeats  for  each  successive  interval.  This  adaptive 
scheme  is  at  least  as  good  as  the  fixed  block  scheme  since 
encoding  begins  immediately  after  the  last  data  bit  of  a  block 
has  been  transmitted.  The  only  overhead  is  the  Huffman  code¬ 
word  for  the  number  of  bits.  In  the  fixed  block  scheme  mes- 
uges  can  be  delayed  while  zeros  are  filling  out  a  block. 

The  successive  coding  intervals  /] .  in  this  scheme  form 
a  Markov  chain.  Since  there  are  a  large  number  of  states,  find¬ 
ing  the  state  probabilities  is  a  formidable  numerical  problem. 
For  our  purposes  a  Monte  Carlo  simulation  of  a  somewhat 
simplified  model  of  the  scheme  is  adequate.  The  simplification 
comes  about  by  upper  and  lower  bounding  the  Huffman  code¬ 
word  by  quantities  that  are  much  easier  to  compute.  The  un¬ 
certainty  of  a  symbol  with  probability  P,  is  -logi  Pt-  Let 
|-log2  P/]  ~  denote  the  integer  part  of  -logj  P/.  If  a  code  with 
lengths  [-logj  F(]~  =  0,  1,  -.1  existed,  its  average  length 

would  lower  bound  the  entropy,  which  in  turn  lower  bounds 
the  average  length  of  a  Huffman  code.  A  code  with  lengths 
|-log2F,]~'f  1  satisfies  the  Kraft  inequality;  consequently, 
a  code  with  these  lengths  can  be  constructed.  However,  such 
a  code  must  have  average  length  greater  than  the  appropriate 
Huffman  code.  Tlie  results  of  the  simulation  are  shown  in  Fig. 
3  where  the  averages  of  the  upper  and  lower  bounds  of  the 
block  lengths,  .'1 .  /a .  '  arc  plotted  as  a  function  of  logj(|/l  - 


7  For  a  treatment  ol  encoding  of  multiple  sources  see  (6| . 
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Fi|.  3.  PerformanCfe  of  adaptive  block  encoding  tehcme. 

F).  The  results  indicate  that  at  P  approaches  one,  the  channel 
is  matched  by  encoding  over  longer  blocks  which  impliet 
larger  delay.  This  result  is  consonant  with  the  results  shown  in 
Fig.  2. 

The  results  of  Figs.  2  and  3  alto  suggest  that  for  a  given 
value  of  P,  the  average  length  of  a  block  it  given  by  block 
/  =>  H(l,  P).  It  can  be  shown  that  there  exists  an  I*  such  that 
^(/j//| )  <  /i  if  /|  >  /•  *nd  £l(/j//,)  >  /,  if  /,  <  I*.  In  Fig.  3 
the  locus  of  such  points,  i.e.,  HQ*,  P)  =  /•,  is  shown.  We 
notice  in  Fig.  3  that  as  P  increases,  this  equilibrium  point 
occurs  for  larger  and  larger  values  of  /,  indicating  larger  and 
larger  coding  delay.  In  fact,  it  can  be  shown  that  the  solution 
to  the  equation  is  approximated  by 

/•  a  2.78/(l  -F)  (4) 

for  P  close  to  one.  Consider  the  entropy  of  a  binomial  distri* 
bution  with  parameters  P  and  /.  The  maximum  probability  ig 
approximately  equal  to  (2n/tl  implying  that  the 

enUopy  is  greater  than  or  equal  to  -J-  log3(2irfXl-F)0.Now,a 
theorem  attributed  to  J.  M.  Massey  states  that  the  entropy  of 
a  discrete  random  variable  with  variance  is  less  than  or 
equal  to  Iog2(2ire(o^  +  1/12)).  Since  the  variance  of  the 
binomial  distribution  is  /XI  -  PV^  we  may  approximate  the 
entropy  by  log2(2jre/Xl  -  fV)-  The  solution  to  the  equation 
-J  log22ire/Xl  -  /V*  +  PI*  =  I*  is  given  by  (4)  foreclose  to 
one. 


FLAGS 

An  alternative  to  block  encoding  is  a  teclinique  employing 
Hags.  Whenever  the  transmitter  has  notliing  to  send,  this  state 
is  indicated  by  means  of  a  unique  /'-bit  sequence,  the  flag. 
During  the  time  interval  required  to  transmit  Uiis  flag,  binary 
digits  produced  by  the  source  are  buffered  and  read  out  after 
the  entire  flag  has  been  transmitted.  The  output  of  tlie  source 
is  transmitted  without  encoding  except  when  the  random 
source  replicates  the  first  F  -  J  bits  of  flag.  In  this  event  the 
transmitter  siuffs  a  bit  which  is  the  lomplemcnt  of  the  last 
bit  of  the  flag.  Thus,  the  only  time  a  flag  is  transm.;ied  it 
when  the  buffer  is  empty  and  n  blank  appears  at  the  output 
of  the  source. 


At  we  shall  tee  presently,  for  each  value  of  F  there  is  a  flag 
length  which  minimizes  bit  delay.  Tliis  minimum  strikes  a 
balance  between  flag  bits  and  stuffed  bits.  As  P  decreases  the 
buffer  is  more  likely  to  be  empty;  therefore,  the  flag  length 
should  be  decreased.  However,  decreasing  the  flag  length  in¬ 
creases  the  occurrence  of  stuffed  bits. 

The  frequency  of  occurrence  of  stuffed  bits  in  a  random 
data  sequence  can  be  analyzed  by  means  of  the  theory  of  re¬ 
current  events.  The  buffer  occupancy  can  be  analyzed  by 
means  of  a  Markov  chain  model.  Included  in  both  of  these 
analyses  are  the  parameters  P,  the  probability  the  source  pro¬ 
duces  a  binary  digit,  and  F,  the  length  of  the  flag.  It  can  be 
shown  that  the  results  also  depend  upon  the  particular  bit  pat¬ 
tern  in  the  flag. 

The  structure  of  flags  was  first  studied  by  Nielsen  [7] ,  who 
defined  the  concept  of  biflx  in  connection  with  flags.  A  flag 
is  biflx  free  if  there  is  no  sequence  which  is  a  preflx  and  a 
sufflx  to  the  first  F  —  1  bits  of  the  flag.  The  sequences  10000 
and  00001  are  examples,  respectively,  of  biflx  free  and  non¬ 
bifix  free  flap.  It  is  a  simple  exercise  to  demmistrate  that  non¬ 
bifix  free  flap  can  cause  delays  in  the  decoding  of  data  bits. 
While  this  is  no  problem  in  terms  of  operation,  the  analysis 
is  complicated.  We  shall,  therefore,  concentrate  on  the  biflx 
free  flag  100  ■**  0  in  the  sequel. 

The  theory  of  success  runs  as  treated  by  Feller  (8)  allows 
one  to  find  the  generating  function  of  the  time  between 
occurrence  of  stuffed  bits  in  a  stream  of  random  zeros  and 
ones.  For  the  moment  we  consider  successive  source  outputs, 
ignoring  intervening  blanks.  Consider  a  flag  consisting  of  a  one 
followed  by  F  -  1  zeros.  If  the  source  produces  a  one  fol¬ 
lowed  by  F  ~  2  consecutive  zeros,  immediately  a  one  is 
stuffed.  This  is  a  renewal  point  since  the  system  erases  all 
memory  and  begins  afresh.  Let  f//  be  the  probability  of  a 
stuffed  bit  after  the  Ah  data  bit.  The  probability  of  a  random 
data  sequence  producing  a  one  followed  by  F  -  2  zeros  is 
2~^*‘ .  However,  a  stuff  bit  can  only  occur  after  F-2  zeros. 
We  have 

U,^2-^*\  (5) 

The  generating  function  of  the  recurrence  interval  is  then 


sjsirf-' 

1  -S +  5(5/2)^-' 


The  mean  recurrence  interval  is 


M-2^-. 


(6) 


(7) 


In  (7)  the  recurrence  interval  is  in  terms  of  successive  zero 
and  one  outputs  of  the  source.  However,  between  these  out¬ 
puts  are  a  geometrically  distributed  number  of  blanks.  If  these 
blanks  are  taken  into  account,  the  average  time  interval  in 
seconds  is  given  by  p/A. 

In  connection  with  the  frequency  of  occurrence  of  stutT 
bits,  the  question  of  stability  arises.  For  the  bursty  model 
the  moan  time  between  blanks  is  1/(1  -  A).  In  order  that  the 
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transmit  biifrer  not  overflow,  it  is  necessary  that  the  mean 
time  between  blanks  be  less  than  tiic  mean  time  beiiveen 
stuffs.  For  (8)  we  have  the  criterion 


ln(/>/{l  -/>)) 

r  ^  "  ■ 

In  2 


+  1 . 


(8) 


The  overhead  incurred  by  the  use  of  flags  other  than  that 
treated  above  also  follows  directly  from  an  analysis  of  success 
runs.  Furthermore,  the  same  analysis  of  the  frequency  bit 
stuffmg  can  be  carried  out  for  the  simultaneous  use  of  several 
different  flags. 

Camrass  and  Gallager  [9]  have  studied  source  encoduig 
by  means  of  flags.  There  is  a  similarity  to  our  work  in  that  we 
both  use  the  theory  of  succen  runs  to  study  the  frequency  of 
occurrence  of  stuff  bits.  However,  the  difference  lies  in  the  use 
to  which  the  flags  are  put.  Camrass  and  Gallager  append  a 
flag  to  a  message  which  happens  to  have  a  geometric  distri¬ 
bution.  In  our  model  the  burst  of  consecutive  binary  digits 
is  geometrically  distributed,  but  we  do  not  delineate  differ¬ 
ent  bursts.  A  flag  is  transmitted  only  if  there  are  no  source 
output  bits  available. 

BIT  ENCODING  DELAY 

The  encoding  delay  of  a  bit  in  the  flag  scheme  can  be 
studied  by  means  of  a  two-dimensional  Markov  chain  model. 
The  chain  is  in  state  (/,/)  if  there  are  i  bits  in  the  buffer  and  if 
j  of  the  immediate  preceding  successive  bits  in  the  data  se¬ 
quence  coincide  with  the  first  j  bits  of  the  flag.  The  last  com¬ 
ponent  is  the  memory  of  the  system  and  has  dimension  F -2. 
The  mechanism  for  state  transmission  may  be  described  by 
considering  specific  cases.  If  the  buffer  is  not  empty,  its  con¬ 
tent  decreases  when  the  data  source  produces  a  blank.  If  the 
buffer  is  empty  when  a  blank  is  produced,  then  the  first  bit 
of  the  flag  is  transmitted  and  the  remaining  F  -  I  bits  are 
inserted  into  the  transmit  buffer.  The  level  of  the  buffer  can 
also  increase  due  to  a  stuff  bit.  We  point  out  that  the  stuffing 
of  a  bit  or  the  transmission  of  a  flag  brings  the  system  to  the 
zero  memory  state. 

Let  us  consider  the  flag  100  ■■■  0,  a  one  followed  by  F  —  1 
zeros.  The  state  transition  diagram  for  this  case  is  shown  on 
Fig.  4.  Let  qij-,  i  =  0,  1 , 2,  — ;  /  =  0,  1 ,  — ,  F  -2  denote  the 
steady  state  probability  that  the  system  is  in  state  (/,/).  We 
can  write  a  set  of  equilibrium  equations  by  considering  par¬ 
ticular  sets  of  states.  Consider  the  states  such  that  i  >  0, 
2  </ <  F -  2;  we  have 


SIATE  Of  kCMORf 

!  A'  '  ‘  '  ] 

u.  ■  I  •  .  •  — U 


Fig.  4.  Markov  chain  model  of  flag  scheme, 
turn  now  to  the  case  /  #  0,  F  -  I ,/  »  Q.  We  have 

« to  =  (I  -  /*)<?!♦  1 .0  +  (/*/2)<Ito  +  (F/2)t7/- ,  .f  _  j .  (11) 

For  I  >  0  and  /  1  we  have 

«ti  »(/’/2)ai  +  (i (12) 

where  qi  &  i  >  0  is  the  probability  that  the 

buffer  size  is  i.  The  final  case  to  be  considered  is  i  =  F  -  1 , 

/-O. 

<tF~  1 ,0  *  (1  *  (F/2)q^ _  i  ,o 

+  (^/2)«tr- (I -/Offo-  (13) 

Define  the  z-transform  of  the  buffer  occupancy  probabil¬ 
ities  at  a  particular  memory  state  as 

m 

Q/(*)  ^  2  *^<7/./;  /  *  0.  1 ,  •••,  F  -  2  (1 4a) 

t“0 


and  the  buffer  occupancy  moment  generating  function  as 


F-2 

«*)-  2  w 

/■o 


(14b) 


Employing  (9)-(13)  together  with  the  deflnitions  in  (14)  it 
can  be  shown,  after  some  manipulation,  that 


qq  =  (P/2)q,j_i+(l-F)q,^tj.  (9) 

The  state  (r,  /)  is  entered  by  the  arrival  of  a  bit  replicating  a 
flag  bit  with  probability  F/2  or  the  arrival  of  a  blank  with 
probability  1  -  P.  For  i  =  0  and  /'  s*  0  we  have 


9oo  =  (l -/*)dio  +  (/’/2)9oo  (10) 

Notice  that  the  zero  memory  state  implies  that  a  flag  or  a 
stuff  bit  has  been  transmitted.  The  system  remains  in  the  zero 
memory  slate  if  a  zero  is  (he  output  of  the  data  source.  We 


•(i-l+Pf 


V-1)  (1- 


/*)<7oi 


-t-d  -/>Xr--l  +/y'"*(z^-l)</oo 


1) 


{(\-F)(z~l+Ff-^-(P2/2f-*].  (15) 


In  (15)  there  are  F  -  I  unknowns  for  uo,;  /  =  0,  1,  •••, 
F  -  2.  In  order  to  solve  for  these  unknowns  we  utilize  a  tech¬ 
nique  employing  Rouche's  theorem  {10},  |ll|.  Q{:)  is  ana¬ 
lytic  within  the  unit  disk;  consequently,  when  the  denomina- 
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tor  has  a  zero  within  the  unit  disk,  the  numerator  must  have  a 
zero  at  the  same  point.  From  the  stability  condition  of  (8)  it 
can  be  shown  that  the  polynomial  in  the  denominator  of  (IS) 
has  -  2  distinct  roots  on  the  unit  disk.  Let  us  denote  these 
roots  as  fl,-;  i  =  1,2,  F  -  2.  Since  the  denominator  of 
Q(:)  must  be  equal  to  zero  at  these  points,  we  have  then  from 
( 1 5 )  the  following  set  of  equations. 


(«,-!+  F)'- ' 


FLAG  LENGTH 


Fig.  S.  Performance  of  flag  ichetnc. 


-(fl.--l  +  f)^-2(e,^-l)J(l  -P)qoi 
+  O-PX0i-i+Pf-Hei''-i)qo<. 

=  0.  1=  1,2,  •.F-2.  (16a) 

We  also  have  the  normalizing  condition 

e{i)=i.  (16b) 

Equations  (16a)  and  (16b)  give  F  —  1  equations  in  /*  —  1 
unknowns. 

Finding  the  roots  of  the  polynomial  P(z)  and  solving  (16) 
for  the  quantities  Qqi,  1  =  0,1,  —,  F"  -  2  are  relatively  straight¬ 
forward  numerical  problems.  We  can  then  calculate  Q(z), 
the  generating  function  of  the  transmit  buffer  occupancy. 
Returning  to  the  equilibrium  equations  (9)-(13),  we  see  that 
the  probabilities  q^j,  i  =  0, 1,2,  ";/  =  0,1,  •,F -  2  can  also 
be  found.  These  wiU  be  used  to  calculate  decoding  delay  in  the 
sequel. 

We  use  as  a  measure  of  performance  the  average  delay 
suffered  by  a  bit  as  determined  by  the  average  number  of  bits 
in  the  buffer.  The  average  number  of  bits  in  the  buffer  can  be 
determined  from  Q(z).  The  results  are  shown  in  Fig.  5  where 
q,  the  average  delay,  is  shown  as  a  function  of  flag  length  with 
1  -  P,  the  probability  of  a  blank,  as  a  parameter. 

As  shown  in  Fig.  5,  for  large  values  of  F,  the  average  delay 
may  be  approximated  by  (F  -  l)/2  for  all  values  of  P.  For 
large  values  of  F,  the  buffer  occupancy  jumps  to  F'  -  1  when  a 
flag  is  sent.  It  more  or  less  linearly  (on  the  average)  returns 
to  zero  as  blanks  predominate  over  stuffed  bits.  When  it 
reaches  zero  a  flag  is  sent  again.  As  the  flag  length  is  decreased, 
the  stuff  bits  come  into  play.  (Recall  that  from  (7)  the  mini¬ 
mum  value  of  F  is  determined  by  considering  stability  in 
connection  with  stuffed  bits.)  Although  it  is  nut  evident 
from  the  cinves  shown  in  Fig.  5  for  large  P,  there  is  a  mini¬ 
mum  value  of  average  delay  at  a  value  of  F  larger  than  the 
minimum.  Iliis  slight  dip  in  the  curves  is  shown  by  the  figures 
in  Table  I. 

As  P  approaches  one,  the  behavior  of  tlie  flag  encoding 
technique  is  similar  to  the  block  encoding  technique  de¬ 
scribed  in  the  above.  As  is  illustrated  on  F'lg.  5,  the  delay  for 
any  value  of  P  may  be  lower  bounded  by  (F  -  1  )/2.  However, 
from  (10)  we  have  that  F  >  log2(F/l  -  F)  +  1,  and  as  P 
approaches  one,  average  delay  becomes  infinite. 


TABLE  I 


i  - 

7 

8 

9 

10 

11 

12 

2-S 

3.82 

3.78 

4.12 

S.03 

5.51 

2-6 

4.40 

4.30 

4.63 

5.53 

2-’ 

4.94 

4.82 

5.13 

5.56 

2-* 

5.47 

5.32 

5.64 

COMPARISON  OF  BLOCK  AND  FLAG  ENCODING 

A  comparison  of  Figs.  3  and  S  shows  a  large  difference  in 
encoding  delay  in  favor  of  the  flag  scheme.  For  example,  for 
F  =  1  -  2“^  the  average  block  length  is  approximately  40 
bits.  We  take  this  to  be  an  approximation  to  the  number  of 
bit  intervals  a  newly  arriving  bit  must  wait,  i.e.,  40  s.  In  con¬ 
trast,  the  encoding  delay  for  a  bit  in  the  flag  technique  can  be 
less  than  4  s  for  the  appropriate  flag  length.  These  results 
apply  generally  to  other  probabilities.  Further,  as  F  -♦  1 . 
delay  in  the  block  scheme  is  proportional  to  1/(1  -  F),  whereas 
in  the  flag  scheme  it  is  proportional  to  log  1/(1  -  F). 

For  the  flag  encoding  scheme,  encoding  delay  is  not  the 
only  delay  encountered  before  a  bit  is  delivered  to  an  informa¬ 
tion  sink  at  the  receiver.  When  a  bit  is  received  there  may  be 
an  ambiguity  that  must  be  resolved.  Moreover,  this  decoding 
delay  is  random  and  depends  upon  the  state  of  the  system, 
begin  the  analysis  by  assuming  the  state  (/,  /)  for  the  flag 
100  —  0.  Recall  that  i  is  the  number  of  data  bits  in  the  trans¬ 
mitting  buffer  and  /  is  the  system  memory  with  respect  to  the 
flag.  The  ambiguity  is  resolved  immediately  if  the  received  bit 
does  not  replicate  the  flag,  i.e.,  a  zero  for  state  j  =  0  and  a  one 
otherwise.  The  ambiguity  persists  if  the  bits  generated  by  the 
source  continue  to  replicate  the  flag,  and  if  the  source  pro¬ 
duces  I  or  fewer  blanks.  To  justify  this  second  requirement  we 
note  that  more  than  i  blanks  would  deplete  the  buffer  and  a 
flag  that  would  be  transmitted  resolves  the  ambiguity.  The 
ambiguity  can  persist  for  at  most  F  -  j  -  \  seconds,  since  the 
first  F  -  1  bits  of  the  flag  would  be  replicated  and  a  stuffed 
bit  would  be  transmitted,  also  resolving  the  ambiguity.  We 
summarize  this  in  the  following  equations  where  the  random 
variable  D  denotes  decoding  delay. 

P,\D>dl{i,i)] 

'  ,  /d  -F  /  -  l\ 

=  (1/:H/’/2)‘'2(1 -F)'(  )  (17) 

t^o  \  I  / 

Lqu.ilion  (17)  is  the  probability  of  a  sequence  replicating  the 
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flag  bits  interspersed  witli  ;  or  fewer  blanks.  Based  on  (17)  we 
have  for  the  probability  of  delay 

Pr[D  =  0liJ]  =1/2  for  all  I,/ 


Pr[D  =  l/f./l 


Pr[D  =  d/iJ] 


1/2;  i  =  F-2 

1/2 >1/0.7); 

7  =  0.  1.-.F-3 


0; 

Pr[D>d-ll(i,i)] 

P,[D>d-lKi,n] 

-Pr[D>d/{i,f)] 


d>F-i 
d  =  F-i-l 

Q<d<F-i-2. 


The  average  delay  can  be  found  by  averaging  over  this  condi¬ 
tional  distribution,  then  averaging  over  the  state  (/.  >)  whose 
distribution  was  obtained  in  the  previous  analysis.  The  results 
are  shown  in  Fig.  6  where  delay  is  plotted  as  a  function  of 
flag  length  with  1  -  Fas  a  parameter.  We  see  that  the  decoding 
delay  is  not  large  and  that  it  is  not  very  sensitive  to  F  and 
1  -  F.  We  conclude  then  that  decoding  delay  has  little  effect 
upon  the  flag  encoding  scheme  as  related  to  the  other  encod¬ 
ing  techniques  we  have  examined. 

CONCLUSION  AND  FUTURE  WORK 

A  simple  source  encoding  problem  which  illustrates  the 
tradeoff  between  delay  and  efficient  encoding  has  been  posed. 
Two  encoding  techniques,  one  using  Huffman  codes  and  one 
using  flags,  have  been  examined  in  detail.  For  the  second  of 
these,  stability  conditions  were  found  and  a  Markov  chain 
model  was  analyzed.  In  both  cases  it  was  found  that  asF-*^  1 
the  delay  approaches  inflnity.  The  paradox  in  this  result  is 
that  for  F  =  1  the  coding  problem  disappears.  A  secondary 
result  is  the  superiority  of  the  flag  encoding  scheme  over 
Huffman  coding  when  ^eiay  is  used  to  measure  performance. 
This  suggests  a  kind  of  compatibility  between  delay  and  flags. 

We  have  also  found  other  bounds.  All  lower  bounds  found 
tend  to  a  finite  value  zs  P  -*  1 ,  while  for  all  upper  bounds 
found  the  delay  becomes  unbounded.  Recently  Roskind  (12 j 
has  found  a  strategy  for  which  the  delay  is  proportional  to 
log  (log  1/(1  -F)l  asF-»  1. 

The  work  can  be  continued  in  several  directions.  The  most 
compelling  of  these  is  the  study  of  a  lower  bound  for  delay 
over  all  possible  codings  especially  when  F  is  close  to  one. 
Also,  the  source  model  can  be  extended  by  considering  more 
general  arrival  processes,  and  by  considering  several  sources 
which  must  remain  distinct,  multiplexing  can  be  addressed. 
Again  it  appears  that  a  Markov  chain  model  can  be  used  to 
describe  a  flag  encoding  scheme. 
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Fig.  6.  Average  decoding  delay  versus  flag  length. 
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ABSTRACT 


A  model  of  a  bursty  source  whose  output  Is 
transmitted  over  a  synchronous  channel  is  studied 
with  the  objective  of  understanding  the  relation¬ 
ship  between  coding  efficiency  and  delay.  Two  en¬ 
coding  schemes,  a  block  encoding  technique  and  a 
technique  employing  flags,  are  examined  in  some 
detail.  The  flag  technique  is  analyzed  by  means  of 
Che  theory  of  recurrent  events  and  by  two  dimen¬ 
sional  Markov  chains.  For  both  the  block  encoding 
and  the  flag  schemes  it  is  shown  that  as  the  source 
becomes  less  bursty  delay  approaches  infinity. 

This  results  obtains  in  spite  of  the  fact  that  both 
schemes  are  reasonable  and  in  the  limit  the  encoding 
problem  disappears. 


INTRODUCTION 


This  paper  is  concerned  with  the  interplay  of 
delay  and  overhead  in  Che  multiplexing  of  data  on 
common  facilities.  The  relationship  of  these  quan¬ 
tities  Is  due  to  certain  unique  properties  of  data 
In  an  Interactive  environment.  A  basic  property 
of  data  traffic  is  burstiness,  l.e.,  short  busy 
periods  Interspersed  with  relatively  long  Idle 
periods  which  contain  no  information.  The  second 
relevant  property  of  data  has  to  do  with  delay.  In 
many  applications  data  traffic  is  highly  inter¬ 
active  placing  a  premium  on  rapid  response  times. 
These  properties  of  data  signals  allow  one  to  for¬ 
mulate  3  source  encoding  problem  which  considers 
delay  as  a  basic  component.* 

SOURCE  MODEL 


In  order  to  study  the  problem  of  delay  In  cod¬ 
ing  we  consider  the  following  model.  A  source  pro¬ 
duces  either  a  blank  or  binary  digit  once  every 
second  with  the  following  probabilities. 


*  Related  work  on  the  encoding  of  bur.sty  sources 
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has  bee. I  carried  out  by  Callagcr  and  Humblec  . 


This  work  was  supported  In  part  bv  NSERC  Grant 
A0901  and  by  the  postdoc tor.j  1  program  xADC  B9-3513 
funded  by  US  Army  COPADCCM. 


P  (blank)  -  1  -  P 
P(0  output]  ■  P/2 
P[1  output]  •  P/2 


(1) 


The  sequence  of  O's  and  I's  produced  by  this  source 
is  to  be  carried  over  a  synchronous ,  error-free  bi¬ 
nary  channel  with  a  capacity  of  one  bit  per  sereni. 
This  source  produces  information  at  an  average  r..tf 
of  P  bits/second.  Thus  for  Pel  there  will  be 
times  when  the  source  has  nothing  to  transmit.  But 
the  channel,  being  synchronous,  transmits  only  a 
zero  or  a  one  every  second  and  this  condition  must 
be  suitably  encoded.  In  general  the  number  of  over¬ 
head  bits  that  are  required  to  encode  the  state  of 
the  source  is  a  function  of  the  allow.sble  .lelav. 

The  relationship  between  del.av  and  ch.ir.ncl 
capacity  can  be  Illustrated  by  means  of  the  foll.-w- 
Ing  encoding  procedure.  All  of  the  source  outr'.'.  ■ 
in  an  1  second  interval  are  encoded  in  a  blocK. 
each  I  bit  Interval  the  information  that  mu.st  no 
transmitted  is  the  number  of  binary  digits  gener¬ 
ated  by  the  source  and  the  values  of  these  digits. 
If  this  requires  less  than  t  bits  intervals,  zeros 
can  be  transmitted  In  the  slack  interval  with  no 
ambiguity.  Since  the  number  of  data  bits  produced 
In  an  t  bit  interval  follows  a  binomial  distribu¬ 
tion,  the  entropy  of  this  encoding  is 


H(t,P)  -  PI  -  I  b(l:l,P)log  b(i:t,P) 


where 


b( 


1-0 

1:1, P)  -\i)p^  (1-P) 


1-1 


(2) 


In  Figure  1  we  show  plots  of  H(t,P)/t  as  a  function 
of  1  for  various  values  of  P.  We  see  that  H(l,P)/i 
asymtotlcally  approaches  P  tor  large  1.  Thus,  th. 
portion  of  capacity  devoted  to  overhead  beo-i 
zero  as  the  block  size  Is  Increased.  Howe  c' 
delay  Increases  as  the  block  length  increiso.,. 
Increasing  1.  the  block  length,  and  by  hold  me  ? 
fixed  the  .source  can  he  marched  to  the  channel  m 
the  sense  of  K(i,P'//l  ^  1  (see  Fig.  1).  On  t  u 
other  hand  It  can  be  shown  that  tor  "  :ixej 
H(l,P)/l  >  1  as  P  apprc.tches  1.  Define  r  •.  1  -  ’ 

For  r  small  h(I,l-E)  =  ll-f)i  -  (1  - tc ) log,  ( 1  - 1 1  ' 

-  tElog.,tc.  But  for  H(l,l-el  ■>  I  we  have 

-(l-leJlcg^d-lE)  >  tE(l  +  log,lc)  if' 

For  C<lc<J  the  RHS  of  Eq .  3  is  neg.it  tve.  S  ;  n.  e  :  b. 
LHS  of  Eq .  3  .s  positive  for  OciL<l,  t^e  i 

Is  proved. 
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*# 


The  foregoing  Illustrates  the  crux  of  the  pro- 
blctn  of  encoding  of  bursty  sources;  the  amount  of 
c>verhi*.id  required  is  n  function  of  the  block  length 
which  translates  directly  Into  delay.  The  same 
phenomena  can  be  seen  operating  for  more  complex 

2 

source  models  involving  multiple  sources  . 


From  F.q .  4  one  can  find  the  generating  function  of 
the  recurrence  times. 


\-z*(zl2) 


We  can  calculate  Che  mean  recurrence  time  as 


Flags  can  be  used  to  encode  the  basic  bursty 
soiree  discussed  above.  The  encoding  process  Is 
initiated  by  cransmicclng  Che  digital  output  of  the 
source.  When  a  blank  appears  a  flag  composed  of  a 
unique  F-bit  sequence  is  transmitted.  During  the 
time  interval  required  to  transmit  this  flag  binary 
digits  produced  by  the  source  are  buffered  and  read 
out  after  the  entire  flag  has  been  transmitted.  The 
output  of  the  source  Is  transmitted  without  encod¬ 
ing  except  when  the  random  source  replicates  Che 
first  F  -  1  bits  of  flag.  In  this  event  the  trans¬ 
mitter  stuffs  a  bit  which  Is  the  complement  of  the 
last  bit  of  the  flag.  The  only  time  a  flag  la 
transmitted  is  when  Che  buffer  is  empty  and  a  blank 
appears  at  the  output  of  the  source.  For  each  value 
of  P  there  Is  a  flag  length  which  strikes  a  balance 
between  flag  bits  and  stuffed  bits.  As  P  decreases 
the  buffer  Is  more  likely  to  be  empty  therefore  the 
flag  length  should  be  decreased.  However  decreasing 
the  flag  length  Increases  the  occurrence  of  stuffed 
bits. 

The  frequency  of  occurrence  of  stuffed  bits  in 
a  random  data  sequence  can  be  analysed  by  means  of 
the  theory  of  recurrent  events.  The  buffer  occu¬ 
pancy  can  be  analyzed  by  means  of  a  Harkov  chain 
model.  Included  In  both  of  these  analyses  are  the 
parameters  P,  the  probability  the  source  produces  a 
binary  digit  and  F,  the  length  of  the  flag.  To  some 
extent,  the  results  also  depend  upon  the  particular 
bit  pattern  In  the  flag.  ^ 

The  theory  of  success  runs  as  treated  by  Feller"^ 
allows  one  to  find  the  generating  function  of  the 
time  between  occurrence  of  scuffed  bits  In  a  stream 
of  random  zeros  and  ones  (For  an  alternative  ap¬ 
proach  see  Ref.  A).  Consider  a  flag  consisting  of 
F  zeros.  If  the  source  produces  F  -  1  consecutive 
zeros  Imv.edlately  a  one  Is  stuffed.  This  Is  a  re¬ 
newal  point  since  Che  system  erases  all  memory  and 
begins  afresh  the  counting  o'  strings  of  zeros.  Let 
L'j  be  the  probability  of  a  stuffed  bit  after  the  Jth 

data  bit.  The  probability  of  the  source  producing  a 
sequence  of  F  -  1  consecutive  zeros  on  bit  numbers 

-F4-1 

j  ,  J  -  1 , . . . ,  J  -  F  +  2  Is  2  .  There  is  one  and 

only  one  stuff  bit  after  one  of  these  zeros.  The 
probability  that  the  stuff  Is  after  bit  number  j-k 

and  the  next  k  bits  are  zeros  Is  U  2  (k*0,l,.... 

J 

F  -  2|.  Since  F  -  1  possibilities  are  mutually  ex¬ 
clusive  we  have  the  following  recurrence  relation¬ 
ship 


J  1  F  -  1 


1  -  (1/2)'"'^  .  ,F 

F  z  -  i 

(1/2)' 


The  sensitivity  of  the  stuffing  process  to  the  part¬ 
icular  flag  pattern  is  illustrated  by  considering 
the  flag  100... 0,  a  one  followed  by  F  -  1  zeros. 

The  probability  of  a  random  data  sequence  producing 

F-1 

a  one  followed  by  F  -  2  zeros  Is  (1/2)  ;  however 

a  scuff  bit  can  only  occur  after  F  -  2  zeros.  We 
have 

The  generating  function  of  the  recurrence  times  Is 
then 

'z/2)^-^ 

F,(z) - - =r-r  (7) 

1  -  z  +  {zm 


The  mean  recurrence  time  Is 


V2-2' 


It  Is  of  Interest  to  note  that  from  Eqs.  6  and  8 


11m  “l 


Thus  in  the  limit  the  sequence  1000. . .0  evokes  twice 
as  many  stuff  bits  as  the  sequence  000... 0. 

In  connection  with  the  f requency  of  occurrence 
of  stuff  bits  Che  question  of  stability  arises.  For 
the  bursty  model  the  mean  time  between  blanks  is 
1/(1-P).  In  order  that  the  transmit  buffer  not 
overflow  it  is  necessary  that  Che  mean  time  between 
blanks  be  less  than  Che  mean  time  between  stuffed 
bits.  From  Eq .  6  we  can  derive  the  following 
stability  criterion  for  the  all  zero  sequence. 

^  ,  ln(2-F(l/l-P)) 


From  Eq .  8  we  have  the  criterion  for  the  sequence 

100.. .0 

,  ^  ln(l/(l-P)  ^  , 


Bit  delay  for  the  flag  scheme  can  be  studied 
by  means  of  a  Harkov  chain  model.  For  ease  of 
analysis  wc  only  cunslder  the  flag  100... 0.  The 
same  general  conclusions  apply  to  the  all  zero 
flag.  The  state  of  tl.e  chain,  is  two  dimen¬ 

sional  consistlry  of  the  number  of  bits  In  the  buf¬ 
fer,  i,  and  of  the  number  jf  bits  in  the  data 
sequence  that  replicate  the  flag,  J.  The  last 
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component  Is  the  memory  of  the  system  and  has  di¬ 
mension  F  -  1.  The  mechanism  for  state  transmis¬ 
sion  may  be  described  by  considering  spec  if  Ic  cases. 
If  the  buffer  Is  not  empty  its  contents  decrease 
when  the  data  source  produces  a  blank.  If  the  buf¬ 
fer  is  empty  when  a  blank  is  produced  then  the 
first  bit  of  the  flag  Is  transmitted  and  the  re¬ 
maining  F  -  1  bits  are  inserted  into  the  transmit 
buffer.  The  level  of  the  buffer  can  also  Increase 
due  to  a  stuff  bit.  We  point  out  that  the  stuffing 
of  a  bit  or  the  transmission  of  a  flag  brings  the 
system  to  the  zero  memory  state. 

The  state  transition  diagram  for  the  flag 
100.  .  .0  is  shown  In  Fig.  2.  Let  <l£jl  0,1,2...; 

J  •  0,1,...,  F  -  2  denote  the  steady  state  probabi¬ 
lity  that  the  system  is  in  state  (i,i).  We  can 
write  a  set  of  equilibrium  equations  by  considering 
particular  secs  of  states.  Consider  the  states  such 
that  1^0,  2£j^F-2we  have 

‘'ij  ■  ^ 

The  state  (i,J)  is  entered  by  the  arrival  of  a  bit 
replicating  a  flag  bit  with  probability  P/2  or  the 
arrival  of  a  blank  with  probability  1  -  P.  Now 
consider  the  case  where  a  string  of  zeros  is  inter¬ 
rupted  by  a  one.  We  have 

“00  ■  -  '>“10  * 

Notice  that  Che  zero  memory  state  implies  that  a 
flag  or  a  stuff  bit  has  been  transmitted.  The  sys¬ 
tem  remains  in  the  zero  memory  state  if  a  zero  is 
Che  output  of  the  data  source.  We  turn  now  to  the 
case  1  d  0,  F  -  1,  J  -  0.  We  have 

“lO  ■  ^^"'>“1+1,0  *  *  ^'^^>“i-l,F-2 

(13) 

For  12.®  j  2  1  '•« 


Employing  Eqs.  11-14  together  with  the  definitions 
in  Eq.  IS  it  can  be  shown,  after  some  manipulation, 
that 

-  I  l(z-l)  (^)  (z-l+P)‘'^ 

i-1 

-  (z-l+P)'"^  (z'  -  1)1  (l-P)qot 

4  (1-P)  (z-l.P)'-^z'  -  l)q„„  . 

Q(z)*  p_2  '  F-1 

(z-1)  Kl-P)  (z-l+P)  -  (Pz/2)  ) 

In  Eq .  16  there  are  F-1  unknowns  for  qgt; 

t  *  0,1,...,  F  -  2.  In  order  to  do  this  we  utilize 
a  technique  employing  Rouche's  theorem.  Q(z)  is 
analytic  within  the  unit  disk;  consequently,  when 
the  denominator  has  a  zero  within  the  unit  disk  the 
numerator  must  have  a  zero  at  the  same  point.  From 
the  stability  condition  of  Eq.  10  it  can  be  shown 
that  the  polynomial  in  the  denominator  of  Eq.  17 
has  F  -  2  distinct  roots  on  the  unit  disk.  Let  us 
denote  these  roots  as  6^;  1  •  1,2 . F  -  2. 

Since  the  denominator  of  Q(z)  must  be  equal  to  zero 
at  these  points,  we  have  then  from  Eq.  17  the  fol¬ 
lowing  set  of  equations. 

-  I  I(6i-1)  (8.-1+-')  ‘ 

I'l 

-  (e^-i^.ri'"^-(e^  -  i)l  d-Piq^^ 

+  (i-p)  (e^-i+P)'"^  (e[  -  Dq^j,  -  0, 

1  -  1,2,. ..,F  -  2  (17«) 

We  also  have  the  normalizing  condition 


“ll  ■  “l  *  “i+1.1 


Q(l)  -  1 


(17b> 


The  final  cases  to  be  considered  are  Chat  of 
i  -  F  -  1,  j  -  0. 

“f-1.0  -  <^-'>F,0  ■  ("'>“f-1.0 

+  (P/2)qp_2  J..2  +  d-P)qo  (15) 

where 

F-2 

“o  “  “o,j 

J-0  ’■* 

Define  the  z-transform  of  the  buffer  occupancy 
probabilities  as 

Q.(*)  4  I  *^q,  .  5  J  “  Od . F  -  2  (16a) 

^  1-0  ’J 


and 


F-2 

Q(z)  -  [  Q.(*) 

J-0  ^ 


(16b) 


Eqs.  17a  and  17b  give  F-1  equations  in  F  -  1  un¬ 
knowns. 

Finding  the  roots  of  the  polynomial  P(z)  and 
solving  Eq.  17  for  the  quantities  qqt,  1  -  0,1,..., 
F-2  are  relatively  straightforward  numerical  pro¬ 
blems.  We  can  then  calculate  Q(z)  the  generating 
function  of  the  transmit  buffer  occupancy.  Return¬ 
ing  to  the  equilibrium  Eqs.  11-15,  we  see  that  the 
probabilities  q  .,  1  -  0,1,2,...;  J  -  0,1,..., 

^  »  J 

F-2  can  also  be  found. 

We  use  as  a  measure  of  performance  the  aver¬ 
age  delay  suffered  by  a  bit  as  determined  by  the 
average  number  of  bits  in  the  buffer.  The  average 
number  of  bits  in  the  buffer  can  be  determined 
from  Eq.  16  by  differentiation  or  by  calculating 


-  F-2 

y  i  ^  q,,'  Fhr  results  are  shown  in  Fig.  3 

1-0  1-0 

where  q  average  delay  la  shown  as  a  function  of 
flag  length  with  1-P,  the  probability  of  a 
blank,  as  a  parameter. 
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As  shown  In  FiK-  3,  for  large  values  of  P, 

Che  average  delay  may  be  approximated  by  (F-l)/2 
for  all  values  of  P.  This  Is  as  expected  since 
for  large  values  of  F,  stuff  bits  have  little  ef¬ 
fect.  As  the  flag  length  is  decreased  the  stuff 
hits  come  into  play.  From  Eq.  10  the  minimum  val¬ 
ue  of  F  is  determined  by  considering  stability  In 
connection  with  stuffed  bits.  Although  it  la  not 
evident  from  the  curves  shown  on  Fig.  3,  for  large 
P  there  is  a  minimum  value  of  average  delay  at  a 
value  of  F  larger  than  the  minimum.  As P approach¬ 
es  one  the  behavior  of  the  flag  encoding  technique 
is  similar  to  the  block  encoding  technique  describ¬ 
ed  in  the  foregoing.  As  is  Illustrated  In  Fig.  3 
the  delay  for  any  value  of  P  may  be  lower  bounded 
by  (F-l)/2  by  Ignoring  the  effect  of  stuff  bits. 
However  from  Eq.  10  we  have  that  F  =  •»  .  Thus 

as  P  approaches  one,  average  delay  becomes  infinite. 
Eq.  9  indicates  that  we  may  expect  the  same  sort  of 
beliavlor  from  the  all  zero  sequence. 

CONCLUSION  AND  FUTURE  WOKK 


A  simple  source  encoding  problem  which  illus¬ 
trates  the  tradeoff  between  delay  and  efficient 
encoding  has  been  posed.  Two  encoding  techniques 
have  been  examined  In  detail.  For  the  second  of 
these,  utilizing  a  flag,  stability  conditions  were 
found  and  a  Markov  chain  siodel  was  analyzed.  In 
both  cases  It  was  known  that  as  P-»l  the  delay  ap¬ 
proaches  Infinity.  The  paradox  In  this  result  la 
that  for  P  -  1  the  coding  problem  disappears. 

The  work  can  be  continued  In  several  direct¬ 
ions.  First  of  all  the  minimum  delay  over  all  cod¬ 
ing  schemes  for  P  close  to  one  Is  of  Interest. 
Also,  the  source  model  can  be  extended  by  consider¬ 
ing  sore  general  arrival  processes,  (e.g.  Poisson) 
and  by  considering  several  sources  which  must  re¬ 
main  distinct.  Again  it  appears  that  a  Markov 
chain  model  can  be  used  to  describe  a  flag  encoding 
scheme.  Finally  the  subject  of  decoding  delay  for 
the  flag  scheme  merits  attention.  When  a  bit  forma 
part  of  a  flag  sequence  there  Is  an  ambiguity  at 
the  receiver.  The  time  required  to  resolve  this 
ambiguity  Is  a  function  of  the  state  of  the  system. 
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Abstract 


We  consider  the  problem  of  obtaining  an  optimal  strategy  for 
packetizing  a  data  stream  which  enters  a  network  one  character  at  a 
time.  Our  objective  is  to  maximize  the  number  of  users  subject  to  a 
delay  constraint  which  includes  both  queuing  and  packetization  delay. 
We  show  that  the  optimal  strategy  is  to  packetize  with  probability  (1-q) 
after  the  arrival  of  N  characters  and  to  otherwise  packetize  at  the 
arrival  of  the  N+l'st  character  where  N  and  q  are  functions  of  the 
delay  constraint.  A  proof  of  optimality  and  numerical  comparisons  with 
other  packetization  strategies  are  given. 


This  work  was  supported  in  part  by  Grant  80-80-K-0579  from  the  U.S. 
Army  CORADCOM.  This  work  is  a  result  of  Chan  D.  Tsao's  Ph.D. 
Thesis. 
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I.  Introduction 

Most  packet  switched  networks  are  characterized  by  the  "bursty" 
nature  of  the  traffic  that  they  carry,  i.e.,  by  the  fact  that  the  traffic 
sources  deliver  data  to  the  network  intermittently  with  a  high  peak-to- 
average  rate.  It  is  this  phenomenon  that  makes  packet  switching, 
which  gives  the  communication  channel  to  a  user  only  when  he  actually 
wants  to  use  it,  so  attractive.  Ironically,  however,  in  the  most 
extreme  case  of  burstiness,  when  the  sources  generate  data  one 
character  at  a  time,  packet  switching  may  become  unacceptable  because 
of  the  overhead  due  to  the  header  which  must  be  placed  on  each 
packet.  Thus,  if  an  H  character  header  is  used  with  only  one  data 
character  per  packet,  an  overhead  of  100H%  is  incurred.  Typical 
values  of  H  range  between  6  and  32  in  networks  in  operation  today.  If 
a  random  access  mechanism,  where  users  transmit  at  random  times  and 
contend  for  the  channel  is  used,  a  significant  number  of  additional 
synchronization  bits  must  be  added  to  the  header  and  H  may  become  as 
large  as  60  in  some  cases.  As  random  access  becomes  more  prevalent, 
as  it  is  in  local  area  networks  today,  this  problem  becomes  more  severe. 

An  obvious  solution  to  this  problem  is  to  buffer  input  characters 
until  a  reasonable  number  of  them  can  be  placed  in  a  packet  together. 
If  N  data  characters  are  placed  in  a  packet  with  an  H  character  header, 
the  overhead  is  reduced  by  a  factor  of  N  when  compared  with  the  case 
where  only  a  single  data  character  is  placed  in  each  packet.  Now, 
however,  one  must  also  consider  the  effect  of  packetization  delay.  If 
we  wait  for  N  characters  to  arrive  before  packetizing ,  we  incur  a 
packetization  delay  roughly  linear  with  N.  If  the  input  channel  or 
the  source  is  low  speed  and  there  is  a  tight  constraint  on  delay,  this 
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FIGURE  1  NETWORK  MODEL 


I 


packetization  delay  is  significant  and  must  be  considered  when  selecting 
the  packetization  strategy. 

Such  a  situation  arises  in  practice  when  many  interactive  users 
enter  a  packet  switched  network  over  low  speed  input  lines  and  then 
share  a  high  speed  line  within  the  network,  as  is  shown  in  Figure  1. 

There  are  M  users,  each  of  which  produces  an  average  of  p 
characters  per  slot.  The  slot  is  defined  to  be  the  length  of  time  it 
takes  the  low  speed  input  line  to  transmit  a  character .  Thus  p  is 
always  between  0  and  1.  We  assume  that  the  users  are  independent  of 
each  other  and  also  that  characters  from  a  given  user  arrive 
independently.  In  each  slot,  each  user  produces  a  character  with 
probability  p.  The  high-speed  network  link  shared  by  those  users  has 
a  speed  of  r  characters/slot. 

In  some  applications,  the  initial  network  node  can  echo  characters 
typed  by  the  user,  and  the  problem  of  trading  network  efficiency 
against  delay  perceived  by  the  user  can  be  avoided.  Thus,  the  node 
will  echo  individual  characters  back  to  the  terminal,  thereby  satisfying 
the  user's  constraint  on  delay,  and  also  buffer  a  reasonable  number  of 
characters  for  inclusion  in  each  packet,  thereby  keeping  the  overhead 
due  to  the  packet  header  acceptably  low. 

In  many  applications,  however,  this  approach  is  not  feasible.  For 
reliability,  some  users  prefer  an  echo  from  the  remote  host  to  ensure 
that  the  data  was  not  altered  in  transit  through  the  network.  In  some 
cases,  the  host  echos  not  just  what  is  typed  but  also  fills  in  default 
values  for  the  user.  An  example  of  this  is  a  system  which  will  fill  in 
the  remainder  of  a  keyword  or  file  name  once  the  user  has  entered  an 


unambiguous  prefix  (e.g.,  the  user  types  "COP"  and  the  system  echos 
"COPY  FROM"  because  the  only  command  which  starts  with  COP  is  the 
COPY  command).  Another  reason  for  a  remote  echo  is  that  the  network 
node  may  be  unable  to  echo  characters  on  a  selective  basis  and  unable 
to  handle  the  load  of  echoing  all  characters.  Thus,  the  problem  of 
handling  sources  which  produce  characters  at  a  low  rate  and  which 
require  packetization  within  an  amount  of  time  comparable  to  the  inter¬ 
arrival  time  of  individual  characters,  is  significant. 

We  thus  consider  a  problem  where  we  are  given  a  constraint  on 
delay,  D  (in  units  of  slots),  which  includes  both  packetization  and 
network  delay.  For  simplicity,  we  consider  the  case  where  the  network 
consists  of  a  single  link  as  shown  in  Figure  1.  Other,  more  complex 
situations  can  be  modeled  by  adjusting  D  to  reflect  the  allowable  delay 
after  subtracting  off  other  delays,  e.g.  host  delay,  and  by  adjusting 
the  expression  for  network  delay  to  reflect  the  presence  of  many  network 
elements.  This  is  beyond  the  scope  of  this  paper  and  will  not  be 
considered  any  further. 

Our  objective  is  to  maximize  M,  the  number  of  users  active  at  any 
time,  subject  to  a  constraint  on  D,  the  overall  delay,  by  obtaining  the 
optimal  packetization  strategy  which  keeps  both  the  packetization  and 
network  delay  small.  Specifically,  we  seek  to  maximize  M  subject  to  a 
constraint  on  D,  the  average  per  character  delay,  where  D  is  given  by: 


D  -  D  1  +  D  +  D 
pk  s  w 


where  Dpj^.  Dg,  and  are  the  average  per  character  delays  due  to 
packetization,  service  (transmission)  and  waiting  (for  the  shared 
network  link),  respectively. 


II.  Delay  and  Packetization  Models 

We  model  a  packetization  scheme  by  a  state  transition  diagram  as 
shown  in  Figure  2.  We  consider  only  packetization  schemes  which  form 
packets  entirely  from  characters  from  a  single  user  and  furthermore 
restrict  consideration  to  memoryless  strategies,  i.e.,  strategies  which 
consider  only  what  has  happened  since  the  last  packet  was  formed  in 
making  the  decision  when  to  form  the  next  packet. 

Figure  2  can  be  thought  of  as  the  state  of  the  buffer  corres¬ 
ponding  to  a  single  user.  The  initial  state  (at  the  top  of  the  diagram) 
corresponds  to  the  buffer  being  empty  immediately  after  a  packet  is 
formed.  Transitions  in  the  diagram  correspond  to  state  changes  over 
a  single  slot  of  width  A.  With  probability  p,  a  character  arrives  and  a 
transition  to  the  left  to  a  shaded  node  occurs.  With  probability  1-p,  no 
character  arrives  and  a  transition  to  the  right  occurs.  If  a  packet  is 
formed,  as  at  node  j  in  Figure  2,  the  transition  is  upward  to  a  state 
immediately  following  the  formation  of  a  packet. 

Each  state,  i,  has  associated  with  it  n.,  the  number  of  characters 
in  the  buffer;  d.  the  aggregate  packetization  delay  suffered  by  all  these 
characters;  and  r.,  the  probability  of  reaching  state  i.  The  initial 
state,  0,  has  nQ=0,  ^^=0,  and  rQ=l.  Figure  3  shows  the  general 
situation  for  a  downward  transition  (i.e.,  from  a  state  in  which  a 
packet  is  not  formed.  Note  that  the  relations  between  the  quantities  at 
nodes  j,  k  and  1  are  not  functiuns  of  whether  or  not  a  character 
arrived  at  node  j. 

The  packetization  rule  can  be  thought  of  as  choosing  which  nodes 
in  this  tree  to  packetize  at.  Indeed,  any  memoryless  rule  which  forms 
single  user  packets  can  be  so  represented. 
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FIGURE  2 


STATE  TRANSITION  DIAGRAM 
CORRESPONDING  TO  A  PACKETIZATION  SCHEME 


FIGURE  3 

INCREMENTAL  EFFECT  OF  NOT  PACKETIZING 
AT  NODE  j 


nk=n.+l 


dk=dj.n. 


j,n.,d.,r. 


nj=n. 


dfdj+n. 


r,=(l-p)T. 


A.  Packetization  Delay 

If  the  packetization  scheme  results  in  a  packet  length  distribution 


f.  =  Prob  {Packet  length  is  i  characters} 

/ 


then 


the  average  per  character  packetization  delay,  is  given  by 


1 
i  y 


f.  d. 
1  1 


where  d.  is  the  total  average  packetization  delay  suffered  by  all 
characters  in  a  packet  of  length  i  and  y  is  the  average  packet  length 
which  is  given  by 


y  =  i  i  fj 

i 


Since  characters  arrive  independently  at  a  rate  of  p  characters  per 
slot,  the  average  interarrival  time  between  characters  is  ^  slots  and  d. 
obeys  the  recurrence  relation 


d. 

1 


+ 


i-1 

P 


This,  together  with  the  initial  condition  dQ=0  yields 


d. 

1 


2p 


Thus,  for  example,  if  the  scheme  resulted  in  a  2-point  distribution  with 


y 


n+q 


5  =  (l-q)n(n-l)  q(iT«-l)n 

pk  (n+q)2p 

=  Yil  ^  q(i-q) 

2p  2py 

This  function  is  drawn  in  Figure  4.  As  can  be  seen,  the  function  is 
continuous  and  piecewise-differentiable.  It  behaves  roughly  like  the 
linear  function  and  indeed  matches  that  function  for  q=0  and  q=l. 
The  difference  between  the  two  functions  is  greatest  near  q=n[(l+^)^-l] 
This  difference  decreases  as  y  increases. 

B .  Service  Time 

The  average  service  time  per  character,  D^,  is  given  by 
1 

^s  =  f  ^  ^  H 

1  y 

where  r  is  the  speed  of  the  high-speed  lines  in  characters/slot  and  H  is 
the  number  of  characters  in  the  packet  header.  For  the  2-point 
distribution  above. 


Dg  =  -  [  y  +  H  + 

®  r  y 


Thus  we  see  that  D  behaves  in  a 

s 

continuous,  piecewise-differentiable 
very  closely,  the  difference  getting 


manner  very  similar  to  D  ,  .  It  is 

pk 

and  follows  a  linear  function  of  y 
smaller  as  y  increases. 
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C.  Waiting  Time 

The  exact  relationship  between  the  character  arrival  rate,  packet¬ 
ization  strategy  and  D^,  the  average  per  character  waiting  time  for 
the  high  speed  channel,  is  complex.  We  believe  a  reasonable  approxi¬ 
mation  to  this  delay  is  given  by  approximating  D  by  the  waiting  time 

w 

in  an  M/G/1  queuing  system.  Since  the  character  arrival  process  is 
Poisson,  this  seems  reasonable.  Thus,  is  approximated  by 

n  = 
w  2(l-p) 

where  M  is  the  number  of  users 

X  is  the  average  arrival  rate  per  user  in  packets  per  slot 
is  the  second  moment  of  the  service  time  for  a  packet 
and  p  is  the  average  channel  utilization. 

For  the  system  at  hand, 

X  =  ? 

y 

p  =  pM 

r  y 

f.(i+H)2 


For  the  2-point  distribution  above, 

^  =  -p  I(y+H)2  +  q(l-q)] 

and 


4 


“4 


-0 


*4 


>0 


i 
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2r2  [1-  EM  j 

r  y 

D .  Total  Delay 

From  the  above,  we  have  a  general  expression  for  D,  the  average 
per  character  delay  in  slots. 


MA(i-HH)^ 


5  =  2  f. 

i  *  2py  r  Zr^fl-p) 


(*) 


III.  Derivation  of  the  Optimum  Packetization  Strategy 

We  now  consider  the  problem  of  maximizing  M,  the  number  of 

users,  subject  to  a  constraint,  D*,  on  the  average  per  character  total 

delay.  We  thus  seek  the  optimal  value  of  the  packet  length,  y,  and  the 

optimal  packetization  scheme  which  will  yield  this  optimal  length. 

Given  M  and  y,  we  see  from  equation  (*)  that  the  packetization 

scheme  that  minimizes  D  is  one  which  minimizes  the  second  moment  of 

the  packet  length,  li^f..  We  are  constrained,  of  course,  to  packet 

i 

length  distributions  where  the  packet  length  only  takes  integer  values. 
It  is  easy  to  see  that  the  distribution  which  minimizes  the  second 
moment  subject  to  these  constraints  is  the  2-point  distribution  which  we 
have  been  discussing;  i.e., 

1-q  i  =  n 

fj  =  q  i  =  n+1 

0  otherwise 

where  y  =  n+q.  To  see  this,  we  use  the  technique  of  Lagrange 
multipliers  (9)  to  find  the  minimum  of 

li^t.  +  a  Zif.  +  6Zf.  where 
1  .1.1 
1  1  1 

a  and  p  are  the  Lagrange  multipliers.  The  last  two  terms  are  used  to 
satisfy  the  constraints  Zif.  =  y  and  Zf.  =  1.  In  addition,  all  f.  ^  0. 
Differentiating  with  respect  to  the  f.  we  get 

=  0  for  all  i  such  that  f.  >  0 

i^  +  ai  +  P  * 

g  0  for  all  i  such  that  f.  =  0 

1 


Thus  there  are  at  most  two  non-zero  values  of  f.  that  can  satisfy  the 


equality  condition  above.  To  satisfy  the  inequality  condition  these  two 
values  of  i  must  differ  by  one. 

Thus  we  see  that  for  a  given  value  of  y  and  M,  the  total  delay  is 
minimized  by  any  packetization  scheme  which  gives  rise  to  a  2-poinl 
packet  length  distribution. 

We  see  from  this  that  we  need  only  consider  2-point  packet  length 
distributions  in  order  to  obtain  an  optimal  packetization  strategy  since 
minimizing  delay  for  a  given  number  of  users  and  maximizing  the 
number  of  users  for  a  given  delay  are  complementary  problems.  The 
question  of  how  to  find  the  average  packet  length,  y,  still  remains  and 
will  be  discussed  below. 

A  similar  result,  proving  that  a  2-point  packet  length  distribution 
is  optimal,  can  also  be  obtained  by  approaching  the  problem  in  a 
different  way  by  considering  packetization  delay  and  service  time 
together.  Any  packetization  scheme  can  be  thought  of  as  a  decision 
rule  which,  given  the  state  of  the  buffers,  decides  whether  to  packetize 
or  not.  Figure  2  shows  the  state  space  for  this  decision  process. 

Tracing  a  path  from  the  root  to  any  node,  j,  one  can  determine 
the  probability,  r.,  of  reaching  that  state  by  multiplying  all  of  the  link 
probabilities.  One  can  also  find  the  number  of  characters,  n.,  in  the 
buffer  by  counting  the  number  of  shaded  nodes  in  the  path  from  the 
root  to  node  j.  Finally,  one  can  determine  d.,  the  total  delay  due  to 
packetization  and  service  time  suffered  by  all  characters  in  the  buffer, 
if  a  packet  is  formed  at  node  j.  This  is  done  by  adding  together  the 
delays  of  all  the  characters  in  the  packet. 

Let  Xj  be  a  decision  variable  taking  the  value  1  if  we  packetize  at 
node  j  and  zero  if  we  do  not.  Any  packetization  scheme  can  be  defined 
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solely  in  terms  of  the  x^.  Furthermore, 


y  =  Z  X.  r.  n. 
j  1  I  I 


D.  =  X  X,  r,  d, 


where  Dj  is  the  average  packetization  and  service  delay  for  all 
characters.  We  seek  to  maximize  y  subject  to  the  constraint  Dj/y  S  D*. 
There  is  also  an  additional  constraint  that  the  packetization  scheme  be 
consistent.  In  particular,  x^  =  1  implies  that  Xj^  =  0  for  all  successors 
k  of  node  j  since  once  we  packetize  at  node  j  we  will  never  reach  node 


It  is  somewhat  easier  to  see  what  the  optimal  policy  is  if  we 
consider  the  decision  process  in  a  slightly  different  way.  We  define  a 
new  set  of  decision  variables,  w^.,  associated  with  the  nodes  j.  The  w^. 
are  related  to  the  x.  above  by  the  rule  that  if  x^  =  1  then  w.  =  1  for  all 
predecessors  of  j  and  w.  =  0  otherwise.  For  all  consistent  values  of  x. , 
the  w.  are  well  defined.  Setting  w.  =  1  can  be  thought  of  as  deciding 
not  to  packetize  at  node  i. 

We  now  define  g.  as  the  increment  in  packet  length  due  to  the 
decision  not  to  packetize  at  node  j.  Similarly  we  define  h.  as  the 
increment  in  the  average  total  delay,  Dj. 

The  expressions  for  y  and  Dj  can  then  be  rewritten  in  terms  of 
the  new  decision  variables,  w^,  as  follows: 


y  =  I  w.  g. 

J  1  ^1 


D,  =  I  w,  h. 

J 


The  incremental  effect  of  not  packetizing  at  node  j  is  shown  in 


Figure  3.  As  can  be  seen  if  =  1,  two  nodes,  k  and  1,  will  be 
counted  instead  of  node  i  in  the  sums  for  y  and  Dj.  Thus: 


g.  =  (l-q)r. 


h. 

J 


2n.  +  H  +  1 


Cn.  + 


In  order  to  maximize  y  for  a  given  D  we  would  prefer  to  choose 
nodes  j  with  maximal  g^/hj,  i.e.  nodes  which  maximize  the  incremental 
gain  in  y  per  unit  Dj.  Note  that  in  reality  it  is  Dj/y  not  Dj  which  is 
the  constraint  but  since  we  seek  to  maximize  y  we  are  simultaneously 
loosening  the  constraint  as  much  as  possible. 

In  order  to  maximize  Qj/hj  we  choose  nodes  j  with  the  smallest 
possible  n.  and  set  w,  =  1  for  those  nodes.  We  continue  to  pick  nodes 
until  we  find  that  setting  the  next  w.  =  1  would  cause  Dj/y  to  exceed 
D*.  Until  this  point,  we  have  assumed  for  the  sake  of  simplicity  that 
w^  would  only  take  the  values  0  or  1.  In  fact  we  are  free  to  assign 
any  value  in  the  range  0  to  1  to  each  w. .  At  the  point  where  setting 
Wj  =  1  would  violate  the  constraint  on  Dj,  there  exists  a  value,  a,  of 
w^  which  would  cause  the  constraint  to  be  satisfied  exactly,  since  D*y 
is  a  monotone  function  of  w^.  Thus,  we  set  w^  =  a  for  this  last  j  and 
satisfy  the  constraint  with  equality. 

It  is  clear  that  this  procedure  will  indeed  find  an  optimal  set  of 

values  for  the  w.  in  the  sense  that  y  is  maximized.  To  see  this  we 
J 

need  only  note  that  in  order  to  increase  the  value  of  any  w.  above  the 
value  set  by  this  procedure  we  would  have  to  correspondingly  decrease 


the  value  of  some  other  Wj^  with  a  resulting  decrease  (or  at  best  no 
change)  in  y.  This  is  because  we  are  selecting  the  nodes,  j,  in  order 
of  g^/hy  largest  first. 

Note  that  at  any  stage  there  is  an  infinite  number  of  nodes  j  with 
the  same  value  of  n.  to  choose  from.  Any  selection  will  yield  exactly 
the  same  value  of  y.  Indeed,  one  need  only  ask  if  w^  can  be  set  to  1 
for  all  nodes  with  a  particular  n^.  If  so,  we  do  this  before  asking  the 
question  for  the  next  larger  value  of  n^.  Thus  we  avoid  any  conceptual 
difficulty  with  the  finiteness  of  the  decision  process. 

We  are  left  then  only  with  the  question  the  consistency  of  the 

packetization  policy.  This  too,  however,  is  no  problem.  Note  that  if 

we  set  w.  =  1  if  n.  <  n*  and  also  for  a  collection  of  nodes  with  n.  =  n* 
1  J  1 

and  with  aggregate  probability  1-q  then  we  have  a  consistent 
packetization  policy  which  adheres  to  the  optimization  criteria.  This 
policy  can  be  simply  stated,  and  equivalently  implemented,  as  waiting 
until  the  n*—  character  arrives,  flipping  a  coin  which  comes  up  heads 
with  probability  q,  packetizing  at  that  point  if  the  coin  comes  up  tails, 
and  packetizing  after  the  next  character  arrives  otherwise. 

We  have  thus  shown,  again,  that  a  2-point  distribution  minimizes 
delay  due  to  packetization  and  service  time.  Given  M,  the  number  of 
users,  this  also  minimizes  queuing  delay,  as  shown  above.  We  now 
turn  to  the  final  question,  how  to  determine  y,  the  optimal  average 
packet  length,  in  practice. 
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Computation  of  the  Optimal  Packet  Length 


Examination  of  the  expressions  for  the  components  of  delay  given 
in  Section  II  shows  that  grossly,  packetization  delay  and  service  time 
grow  linearly  with  packet  length  while  queuing  packetization  delay 
decreases  as  the  packet  length  increases  (since  the  overhead  due  to  the 

header,  which  contributes  to  the  utilization,  decreases).  There  is, 

however,  a  term  proportional  to  q(l-q)  (where  q  is  the  fractional  part 
of  the  packet  length)  present  in  each  delay  component.  This  term 

causes  the  derivative  of  delay  with  respect  to  packet  length  to  be 
discontinuous  and  prevents  us  from  simply  doing  a  gradient  search  to 
find  the  optimal  value  of  y.  As  can  be  seen  by  examining  Figure  2, 
however,  one  might  expect  a  simple  search  to  yield  a  reasonable  approx¬ 
imation  to  y  since  the  q(l-q)  term  is  small  and  becomes  even  less 

significant  as  y  increases.  We  are  thus  led  to  the  following  approach. 

Ignoring  the  q(l-q)  terms  in  D,  we  can  find  y,  an  approximate 
value  of  y,  using  a  gradient  search  or  other  simple  search  procedure. 
Thus  we  let 


and 


Mp 


(y+H)' 


D  = 


2p 


^  -  (y+H)  + - 


) 


dy 


2p 


Mp(i^;^)i(i-^)(^>^ 

~  2r^  ll-f 


Given  M,  we  can  then  minimize  D  by  finding  y,  the  value  of  y,  where 
the  derivative  is  0.  Given  D,  the  value  of  M  can  then  be  adjusted 
since  D  is  monotone  with  M;  i.e.,  if  D  is  less  (greater)  than  the  required 


delay,  then  M  can  be  increased  (decreased).  A  binary  search  on  M  can 
thus  be  performed.  A  simple  initial  upper  limit  on  M  for  this  purpose 
is  M=r/p.  Given  these  approximate  values  of  y  and  M,  it  is  then  possible 
to  proceed  in  practice  using  the  real  values  of  D  including  the  q(l-q) 
terms.  The  derivative  is  now  only  continuous  over  individual  integral 
values  of  y  and  so,  strictly  speaking,  a  separate  search  must  be  per¬ 
formed  over  each  such  interval.  In  practice,  at  most  2  such  intervals 
would  have  to  be  searched  one;  the  above  approximate  values  were 
determined  since  the  q(l-q)  terms  do  not  alter  the  result  much.  This 
is  verified  empirically  by  the  computational  experience  presented  below. 


V.  Computational  Experience 


We  now  examine  the  effect  in  practice  of  using  this  optimal  packet¬ 
ization  scheme  rather  tha  •  simply  forming  fixed  length  packets.  This 
issue,  along  with  a  compt  ison  \/ith  other  strategies,  is  examined  in 
more  detail  in  [1].  A  simple  optimization  procedure  was  implemented  for 
this  purpose,  and  is  described  below. 

The  search  procedure  used  is  shown  in  Figure  5.  The  variables  y 
and  M  are  the  current  values  of  the  packet  length  and  number  of  users 
being  examined,  respectively.  The  variables  y*  and  M*  are  the  current 
estimates  of  the  optimal  values  of  y  and  M.  is  the  delay  require¬ 

ment  in  slots.  The  variables  A  and  T  control  the  length  and  precision 
of  the  search  and  were  set  to  .01  and  2,  respectively.  We  are  interested 
primarily  in  cases  where  y  is  small  and  so  this  simple  linear  search  is 
quite  reasonable.  For  large  y  the  procedure  described  in  the  preceding 
section  could  be  implemented  or,  more  simply,  larger  values  of  A  and  T 
could  be  used  with  this  procedure  since  small  differences  in  y  would 
not  alter  M. 

We  examined  problems  with  p,  the  number  of  characters  per  user 
per  slot,  of  and  r,  the  trasmission  speed  of  the  high-speed  channel 
in  characters  per  slot,  of  300.  This  corresponds  to  users  producing 
characters  at  an  average  rate  of  1  per  second  on  a  low-speed  channel 
with  peak  rate  4  characters  per  second  and  sharing  a  high-speed 
channel  of  1200  characters  per  second.  Values  of  H,  the  packet  header 
size,  of  1,  6,  and  32  were  examined. 
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FIGURE  5  SIMPLIFIED  SEARCH  PROCEDURE 
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Table  1  shows  the  results  of  a  comparison  between  the  optimal 


scheme  and  fixed  length  packets.  The  column  labeled  "%-difference" 
shows  the  percentage  improvement  in  M  using  optimal  packet  lengths 


(y*)  rather  than  the  best  fixed  length  Figure  6  shows  this 


comparison  for  H=6.  As  can  be  seen,,  the  improvement  is  quite 
significant  in  some  cases.  In  particular,  this  difference  is  most 
significant  for  small  delay  and  large  H. 

Note  that  the  difference  in  performance  is  not  monotone.  In 
particular,  as  the  allowable  delay  increases,  y*  increases.  As  y*  takes 
a  larger  fractional  part,  M*  increases.  The  fixed  length  scheme, 
however,  cannot  do  this  and  so  increases  only  very  slightly. 

The  rise  in  M*  with  increasing  y*  is  due  to  the  decrease  in  header 
overhead  as  the  packet  length  increases.  The  small  increase  in 
is  due  to  the  slight  increase  in  utilization  permitted  by  increasing  the 
allowable  delay. 


The  column  labeled  in  Table  1  and  shown  in  Figure  1  is  an 


upper  bound  on  M  computed  from  the  simple  observations  that  the 
packetization  delay  must  be  smaller  than  the  total  allowable  delay,  D, 


and  the  packetization  delay  is  at  least  — .  This  puts  an  upper  bound 

zp 


on  y: 


y  g  2p  D  +  1  (2) 

An  upper  bound  on  M  is  then  obtained  by  observing  that  the 
utilization  of  the  high  sneed  channel  is  bounded  by  1: 

M  c  E  ..Y~  /"5  ^ 


Thus 


M  S 


r  2pD->-l 
p  2pD+l+H 


As  we  can  see,  this  bound  is  lightest  for  large  D  and  small  H.  It  is 
sometimes  useful  in  quickly  estiating  M*  to  compare  it  with  which 

is  easily  estimated  by  equations  2  and  3  and  restricting  y  to  integers. 


« 
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VI .  Conclusions 

We  have  presented  a  packetization  strategy  along  with  a  proof  of 
its  optimality.  The  technique  is  easily  implemented  in  practice.  A 
comparison  with  fixed  length  packets  is  presented  and  the  optimal 
technique  exhibits  significantly  higher  throughputs,  in  some  cases  over 
50%  higher.  We  conclude  therefore  that  the  packetization  strategy 
should  be  used,  in  practice,  especially  when  the  delay  constraint  is 
tight  and  the  header  size  is  large. 
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<4falrarf— Rcccnth.  emphasis  has  been  placed  on  inlecralcd  conimuni- 
calion  ladlilics  capable  of  handlin|>  bolh  line-snilching  and  pachel-sHiich- 
big  digital  traffic.  The  problem  of  d>  namicallt  allocaiing  ihe  bandnidlh  of  a 
tttink  to  both  tspes  of  traffic  is  formulated  as  a  Markotian  decision 
process.  Line  switching  is  modeled  as  a  lime  division  mulliplesing  loss 
scheme  over  a  varsing  portion  of  a  fixed  lime  frame.  Packri-swiiching 
traffic  is  served  through  the  remaining  portion  of  the  frame  and  requires 
queueing  al  the  mulliplever-concentralor.  Two  different  cost  criteria  are 
examined  involving  probabilitx  of  blocking  for  line  switching  and  average 
queueing  delav  for  packets.  The  corresponding  optimization  problems  are 
presented  under  reasonable  simplifving  assumplionv  The  movable  boundarx 
scheme  suggested  for  commercial  implementation  of  integrated  multi¬ 
plexers  is  shown  to  offer  optimal  or  near-optimal  performance. 

I.  Introduction 

IN  ORDER  to  multiplex  and  concentrate  different 
data-sending  facilitie!>  over  a  large  bandwidth  channel, 
a  choice  of  a  switching  technique  must  be  made.  Line 
switching  is  used  for  transmission  of  long  messages  and 
requires  a  permanent  connection  between  the  communicat¬ 
ing  ends  for  the  duration  of  the  session.  This  connection 
can  be  a  physical  line,  a  synchronous  slot  allocation  in  a 
time  division  multiplexing  scheme,  or  a  frequency  bund 
allocation  in  frequency  division  multiplexing.  Initial  setup 
times  must  be  short  compared  to  the  message  length  for 
efficient  channel  utilization.  Most  of  the  actual 
line-switched  systems  are  loss  systems,  i.e..  a  request  for  a 
new  line  is  blocked  when  no  bandwidth  is  available,  and  a 
new  trial  must  be  made  later.  The  probability  of  blocking  a 
new  request  is  a  measure  of  the  performance  of  the  system. 

For  short  interactive  types  of  traffic,  messages  arriving 
at  the  concentrator  wait  in  a  queue  (either  as  whole  mes¬ 
sages  or  divided  into  smaller  portions  called  packets)  and 
are  served  according  to  some  discipline.  A  header  m  the 
message  or  packet  is  required  containing  identification  and 
destination  information.  Routing  and  reassembly  of  mes¬ 
sages  at  the  destination  node  are  additional  problems  in 
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such  systems.  A  measure  of  the  performance  of  the  system 
is  the  average  queueing  delay  experienced  by  an  average 
message. 

Gloss  in  [1|  presented  a  comparative  evaluation  of  line- 
and  packet-switching  lossless  schemes  and  showed  the  su¬ 
periority  of  the  former  for  lengthy  message  traffic  and  of 
the  latter  for  bursty  short  packet  traffic.  This  basic  result 
has  been  further  demonstrated  in  [2]-[4]  for  various  classes 
of  models. 

In  a  more  general-purpose  digital  data  communication 
network,  it  is  desirable  to  integrate  both  line-  and  packet¬ 
switching  facilities.  The  user  or  the  network  manager  may 
decide  on  the  most  suitable  discipline.  This  integrated 
facility  can  accommodate  simultaneously  line-switching 
traffic  such  as  voice  traffic  or  computer  batch  traffic,  and 
packet-switching  data  traffic,  such  as  bursty  tenninal-to- 
host  traffic. 

Various  schemes  have  been  studied  and  implemented. 
Fi.xed  frame  fixed  boundary  schemes  allocate  a  pre¬ 
determined  portion  of  a  time  frame  (consisting  of  a  given 
number  of  constant  duration  slots)  to  each  type  of  inffic. 
Fixed  frame  movable  boundary  schemes  improve  the  packet 
traffic  throughput  by  introducing  some  intelligence  to  the 
previous  scheme.  Packet  traffic  is  allowed  to  occupy  any 
idle  slot  of  the  line  switching  portion  of  the  frame. 
The  movable  boundary  scheme  was  first  analyzed  by 
Kummerle  |S]  and  an  implementation  research  project  has 
been  undertaken  by  IBM  Zurich  [6)-[8].  The  proposed 
architecture  is  summarized  in  [9). 

parallel  effort  was  motivated  by  the  U.S.  Department 
of  Defense  aimed  at  replacing  the  AUTOVON 
line-switched  voice  network  and  .^UTODIN  packet- 
switched  data  network  by  an  integrated  structure.  Various 
alternatixes  have  been  considered  by  the  Defense  Com¬ 
munications  Agency  (DCA)  and  its  contractors.  These 
include  either  movable  boundary  protixols  1 101-(  12]  or  the 
use  of  packetized  voice  techniques  to  eliminate  line  switch¬ 
ing  |13j-(l5|.  Commercial  vendors  (such  as  TR.^N  Cor¬ 
poration  (16]  and  Cixiex  Corporation  (17])  have  announced 
integrated  approaches.  The  Codex  approach  uses  variable 
frame  multiplexing  with  the  frame  size  adjusted  to  the 
traffic  variations.  .•Vnalyscs  of  variable  frame  integrated 
multiplexers  can  be  found  in  jlSj  and  |1'^]. 

The  growing  interest  in  integrated  line-  and  packci- 
swiiched  networks,  as  demonstrated  in  the  foregoing  survey. 
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and  specifically  ihc  cfforis  lo  improve  ihe  efficiency  of 
these  system.v  in  terms  of  the  trunk  capacity  utilization,  led 
to  the  study  presented  in  this  paper.  Its  ainr  is  to  investi¬ 
gate  the  limit  of  the  capaeity  utilization  improvement  un¬ 
der  a  basic  constraint:  fixed  frame  size  and  synchronous 
service  provision  to  line-switching  sessions  once  they  get 
access  to  the  system.  A  loss  minlel  is  a.s.sumed  for  the 
line-switching  traffic  and  a  first-in-first-out  queueing 
model  for  the  fixed  size  packet  traffic.  The  problem  of 
optimal  allocation  of  frame  slots  is  formulated  as  a 
Markovian  decision  process.  Two  optimality  criteria  are 
examined.  Under  the  first,  the  average  time  delay  for  the 
packet  traffic  is  minimized  with  the  probability  of  blocking 
a  new  line-switching  arrival  constrained  to  be  below  a 
given  level.  A  linear  programming  method  is  used  to  de¬ 
termine  the  optimal  policy  under  a  birth-death  process 
model  for  the  line-switching  traffic.  Under  the  second 
criterion,  a  linear  combination  of  the  average  time  delay 
and  the  probability  of  blocking  is  minimized.  A  policy 
improvement  algorithm  is  used  to  obtain  the  optimal  poli¬ 
cies  both  under  the  same  birth -death  assumption  and  for 
the  general  case  of  multiple  arrivals  or  completions  in  a 
frame.  It  is  shown  that  under  the  birth-death  assumption, 
the  movable  boundary  scheme  is  in  some  cases  optimum  or 
close  to  optimum  in  the  examples  studied.  Similar  results 
are  obtained  for  the  general  case. 

U.  The  Model 

A.  Input  Traffic 

Line-switching  traffic  is  assumed  to  consist  of  messages 
arriving  with  a  Poisson  rate  of  A,  messages/s.  Message 
length  is  exponentially  distributed  with  mean  I  s.  A  new 
arrival  may  either  be  accepted  or  blocked.  Packet-switching 
traffic  consists  of  fixed  length  packets  arriving  at  a  Poisson 
rate  of  A 2  packeis/s  and  stored  in  a  finite  queue  of 
capacity  Q  packets.  Q  is  assumed  lo  be  large  enough  to 
result  in  negligible  packet  overflow. 

B.  Output  Trunk 

The  incoming  traffic  of  both  types  is  multiplexed  onto  a 
main  trunk  of  capacity  C  packets/s.  A  frame  of  h  s 
duration,  divided  into  M  slots  as  in  Fig.  I,  accepts  both  the 
line-switching  and  packet-switching  traffic.  The  slot  size  is 
chosen  to  accomminlate  exactly  one  packet.  Hence  C  = 
M/b.  Si  slots  in  the ylh  frame  (Fig.  I)  are  allocated  to  the 
line-switching  messages,  one  slot  per  message,  in  a  time 
division  multiplexing  mode  (every  frame  pnxresses  a  frac¬ 
tion  of  a  message  equal  to  the  packet  or  slot  size).  The 
multiplexer  must  guarantee  no  interruption  of  synchronous 
service  to  the  message  already  in  the  system.  A  new  setup 
call  is  stored  and  considered  for  acceptance  to  the  trunk  at 
the  beginning  of  a  new  frame.  The  remaining  A/  -  S,  =  N. 
slots  are  allocated  to  the  packets  present  in  the  packet 
queue  at  the  beginning  of  the yth  frame,  in  a  firsi-in-  first- 
out  mode.  The  object  is  to  determine  the  optimum  alloca- 
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tion  of  Si  and  A',  to  circuit  and  packet  traffic,  respectively. 
As  noted  earlier,  two  optimality  criteria  are  investigated  for 
this  purpose. 

C.  Decision  Mechanism — State  of  the  System 

We  define  the  state  of  the  system  X,  at  the  time  instant 
just  prior  to  the  opening  of  the  y  th  frame  as  follows: 


where 

Lj  number  of  packets  waiting  in  the  packet  queue, 
number  of  messages  already  in  the  system  and  re¬ 
quiring  continuation  of  service,  and 
r,  number  of  new  setup  calls  during  the y  -  1  frame. 

The  multiplexer  decides  on  5,.  based  on  X^.  the  previous 
history  of  the  system,  and  some  optimizing  rule.  Continua¬ 
tion  of  service  for  the  line-switching  traffic  must  be 
guaranteed  and  any  portion  of  the  frame  not  used  by  long 
messages  must  be  allocated  to  packet-switching  traffic. 
Hence  <  5,  ^  min(r,  -t-  i',.  M). 

The  transition  probabilities  from  a  present  state  Af,  to  a 
new  state  ,  depend  only  on  X,  and  the  number  of  slots 
allocated  to  the  line-switching  traffic  of  the  yth  frame  S’,.  In 
order  to  derive  them,  we  need  the  probability  distributions 
of  the  components  of  state  A', . ,.  These  are  given  by  ( 1  )-(3) 
below  and  arc  obtained  by  noting  the  following. 

1'^..,  Number  of  new  setup  calls  during  the  yth  frame. 
The  number  follows  a  Poisson  distribution  with 
mean  A,/>.  (Equation  (1)  follows  directly.) 
r.,  Number  of  messages  requiring  continuation  of 
service  among  the  S^  messages  currently  in  the 
system.  Since  we  assumed  that  the  meNs.ige  lengths 
were  exponential,  r, . ,  depends  only  on  S^.  (This  is 
due  to  the  mcmoryless  property  of  the  exponential 
distribution.)  The  binomial  distribution  of  (2)  fol¬ 
lows  directly.  (The  prnhahilitv  of  a  completion  in 
a  frame  h  s  long  is  I  -  c  ‘‘''.) 

L, .  ,  Depends  on  and  S,  according  to  the  transition 
equations  of  a  finite  queue  with  Pinsson  arrivals 
A  ,/>.  constant  service  b.  and  A',  =  M  -  S,  servers. 
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This  leads  to  (3). 


(A2M' 


i{L^^Nj  =  M  -  S^, 

{L^^,-L^  +  Ny 

if  AT  <L^  <!.,,,  + A'. 

0,  otherwise 

for  all  Lj^,  <  Q.  ^  Q.  and  P^(Q  |  Z.^)  =  I  - 
2?=o^s,(^y+ 1  =  ^  1  ^y)-  (Recall  that  ,  '<  Q.) 

Since  each  of  the  v^^^,  L,+  ,  random  variables  de¬ 

pends  only  on  the  previous  combination,  we  conclude  that 
the  state  transition  probabilities  are  given  by 

/’s,(3fy..|^y)=/*(jfy*,l^y,Sy) 


Depending  on  the  length  of  the  frame  b  and  the  arrival  rate 
X,.  we  may  define,  within  any  acceptable  level  of  accuracy, 
the  maximum  number  of  new  calls  per  frame  CMAX  such 
that  Pipj  >  CMAX)  s  0.  Thus  f,.  r^,  may  assume  a  finite 
number  of  values  each,  and  the  process  has  a  finite  state 
space. 

The  transitions  (4)  define  a  finite  state,  discrete  time 
Markov  process  with  a  decision  mechanism  controlling  the 
evolution  of  the  process.  Note  that  the  vector  state  rep¬ 
resentation  does  not  lead  to  the  erroneous  implicit  inde¬ 
pendence  of  its  components  as  in  (20].  reported  in  (21).  The 
theory  of  Markovian  decision  proces.ses  (see  (22))  provides 
us  with  the  analytical  and  computational  tov>ls  for  the 
evaluation  of  an  optimal  policy  v.  i.e..  an  a.ssienment  rule, 
possibly  probabilistic,  of  an  action  5,  (the  number  of  slots 
to  be  assigned  to  line-switched  traffic)  to  a  state  based 
on  the  history  of  the  process. 


III.  First  Optimality  Criterion 

The  first  optimality  criterion  we  consider  is  that  of 
minimizing  the  average  packet  queueing  delay  Td  with  the 
probability  PI  of  blocking  newly  arriving  line-switching 
calls  constrained  to  be  no  more  than  a  specified  acceptable 
level  FLOSS. 

By  Little's  formula  |2.1|  the  packet  delay  is  proportional 
to  the  average  packet  queue  size  E{  L ).  The  policy  i^f 
assignment  rule  m  that  minimizes  E[L)  is  identical  with 
one  that  minimizes  TJ,  and  thus  we  fiKus  on  E{  L )  only. 
Pt.  henceforth  called  loss  probability,  may  also  be  written 


in  terms  of  expccta;!  ms  of  the  various  parameters  previ¬ 
ously  determined  in  Section  II.  The  number  of  line-svMiched 
calls  turned  away  in  a  frame  is  r  r  -  S^.  the  excess  ol 
calls  over  the  slots  made  available.  /’/  is  the  ratio  of  the 
average  of  this  quantity  to  the  average  number  of  calls  in  a 
frame.  X,A.  Using  subscript  it  to  denote  policy  it.  PI  is 
given  by 


The  optimality  criterion  is  thus  formulated  as 


Here  FLOSS  is  the  maximum  permitted  fraction  of  lost 
calls.  The  transition  probabilities  (4)  and  the  criterion  (6) 
define  a  finite  state  decision  process  with  minimization  of 
the  average  expected  cost  EJ  L, )  and  a  constraint  on  the 
average  linear  action-state  combination  £,(r^ 
turns  out  (see  [24))  that  if  it  is  restricted  to  the  irreducibility 
class  .  as  defined  in  the  Appendix,  then  there  exists  a 
stationary  policy,  assigning  actions  to  states  independently 
of  the  past  history,  which  satisfies  the  optimality  criterion 
of  (6).  It  is  shown  in  [24]  and  [25]  that  the  inequality 
constraint  in  (6)  leads,  in  general,  to  a  possibly  probabilis¬ 
tic  (i.e..  nondeterminisiic)  assignment  rule  for  the  optimum 
policy. 

A  considerable  simplication  of  the  problem  arises  if  v.c 
assume  the  frame  duration  b  is  very  short  compared  to  the 
average  message  length  1  //i.  (This  corresponds  to  the  case 
of  a  high-.specd  trunk.)  This  assumption  is  identical  to  the 
one  used  in  the  Erlang-fi  approximation  for  synchronous 
line-switching  traffic,  and  it  is  shown  to  be  valid  for  a  wide 
variety  of  applications  in  [26).  Under  this  assumption,  the 
number  of  line-switching  messages  in  the  system  may  not 
change  by  more  than  one  from  frame  to  frame.  This 
corresponds  to  a  birth-death  pnxess  model  for  the  line- 
switching  traffic.  There  are  thus  only  two  possible  actions 
for  the  optimal  policy  — to  block  a  new  •  all  or  to  allocate  a 
slot  to  it.  The  three-dimensional  \ectoi  .Y^  is  then  reduced 
to  a  two-dimen.sional  vector  X^  =  (  E^.  r, ).  and  the  possible 
actions  may  be  written  as 

/4(  L, .  rj  =  0.  if  a  new  call  is  blocked. 

.4(  L, .  rj  =  I .  if  a  new  call  is  given  service. 

Obviously,  a  decision  must  be  made  only  if  a  new  arrival 
vxxurs  (with  probability  A, A).  The  boundary  .V  may.  there¬ 
fore.  either  accomnuxJate  the  r  sessions  already  in  the 
.system  or  increase  by  one  in  order  to  alUxate  a  newly 
arrived  session. 
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Note  Ihji  ihc  prohjhililv  of  kiw  /*/.  will  in  this  ca>c  ho 
£.(r,  +  |., -.9,)  0)/>|..  I) 

^ '  X,A  ~  '  \^h 

=  P,{A{L,.r,)=0).  (5a) 

Under  the  birth-desiih  approximation,  ihorefore.  /*/,  is  the 
actual  probability  of  loss  (or  probability  of  blocking)  for 
(he  line*swiiched  irafne.  This  justifies  the  use  of  the  term 
probability  of  loss  for  the  average  fraction  of  (5). 

The  problem  of  minimizing  the  average  packet  queue 
length  given  the  inequality  constraint  on  the  probability  of 
blocking  new  setup  calls  can  now  he  formulated  (22.  p. 
152]  as  (he  following  linear  program. 

For  every  class  of  policies  {»)„.  A,//i  <  M‘  ^  M  (see 
the  Appendix)  determine  the  nonnegative  variables 
P(L,  r.  A)  (the  joint  probabilities  of  state  (Z..  r)  and  ac¬ 
tion  6  (0. 1 )  defining  the  probabilistic  state  action  as¬ 
signment)  for  0  ^  Z.  ^  0  £  r  2  M‘.  in  order  to  mini- 


£(/.)=  2  2  2  LP(L.r.i) 

/.so 

under  the  probability  of  blocking  constraint 

2F(Z../-,^  =  0)SPLOSS.  (7) 

l..r 

(he  linear  transition  constraints,  and  the  probability  mea¬ 
sure  constraint  (all  probabilities  are  nonnegative  and  sum 
to  one). 

The  linear  programming  algorithm  may  be  used  to  com¬ 
pute  different  time  delay  probability  of  blocking  optimal 
pairs.  Apart  from  (he  computational  complexity,  the  main 
disfdvantage  of  (he  method  is  the  nondeterministic  struc¬ 
ture  of  (he  resulting  policies  due  to  the  probability  of 
blocking  constraint.  It  is  thus  desirable  to  introduce  another 
optimality  criterion  leading  to  deterministic  policie.s.  This 
is  done  in  the  next  section.  Computational  results  for  both 
criteria  are  presented  in  Section  V. 

IV.  Second  Opti.mal(ty  CaiTEittON 

The  second  optimality  criterion,  which  leads  to  de¬ 
terministic  policies,  .simply  minimizes  the  weighted  sum  of 
the  average  packet  queue  size  and  the  probability  of  loss 
for  line-switched  calks.  Specifically,  we  choose  to  minimize 

K  =  E,iL)  +  aPl,.  (8) 

By  letting  (he  weight  factor  a  vary  (0  £  a  s  x ).  different 
optimal  pairs  ( £,(  L ).  Pl^ )  may  be  obtained  that  minimize 
g.  It  can  be  shown  |27|  that  these  pairs  correspond  to 
points  on  (he  optimal  average  queue  length  loss  pr«>bahil- 
ity  curve  (the  first  criterion)  as  well.  This  scctuid  criterion 
thus  also  serves  as  a  way  of  computing  some  poinis  on  the 
curve  for  the  first.  By  incorporating  the  previous  constraint 
on  loss  probability  in  the  average  cost  function,  the  prob¬ 
lem  turns  out  to  be  identical  to  a  finiie  state  Markov 
decision  privess  with  average  cost  ininimi/ation.  It  follows 
(22j  that  for  every  irreducible  cla.ss  of  policies  |ff|„  an 


optimal  ptilicv  exists,  and  it  is  determinisiic.  i.e..  ii  has  the 
form  of  a  fixed  assigniiient  of  an  action  to  every  state.  We 
first  consiiler  the  birth  death  approximate  iiioviel  of  Sec 
non  III.  then  take  the  exact  model  with  no  such  approxi¬ 
mation  made. 

A.  The  Birih- Death  Appraximatmi 

Let  us  define  the  one-period  cost  C(X.  S)  as  follows: 
C(X.S)  i  C(L.r.  /<)  i  Z,  +  a(l  -  A). 

{0.  if  ( Z..  r)  is  a  blocking  state. 

I.  if  (Z..  r)  is  an  allowing  state. 


Then  (see  (22.  p.  14 1  j).  the  average  cost  g  will  be  indepen¬ 
dent  of  the  initial  state  ( Z.,).  Zg)  =  and  given  by 

«-**  ..  ft  T  I 


For  every  class  of  policies  ( v},, .  the  transitions  (4)  and  the 
average  expected  cost  (8)  define  a  finite  state  Markov 
decision  process  with  average  cost  minimization  and  valid 
irreducibility  condition.  Therefore,  from  (22.  theorem  6.17} 
we  conclude  that  an  optimal  policy  w  exists  within  every 
class  (v).,,'.  and  it  is  stationary  deterministic.  Equivalently, 
there  exists  a  set  of  bounded  functions  h{X)  =  h(l..  r). 
0  £  r  £  M\  0  £  LS  Q  and  a  ftxed  number  g  such  that 

g  =  A(Z..r) 

=  min  (c(Z,.r.  A) 

4C|0.I)  ( 

+  2  P(L\r'\L.r,A)h(L\r')\.  (9) 

t:.  r-  ’ 


where 


C(L.r.A)  =  LA-  a(l  -  A) 


P(  L\  r-\L.r.A) 

^  lP,{L'\L)P,(rr 

j/>,.,(Z.' I  £)/»„,(/'). 


if  no  arrivals  occur, 
if  one  arrival  occurs. 


Again.  P,{l.'\L)  is  given  by  (3).  while  under  the  birth- 
death  assumption 

probability  of  no  arrivals  =  I  -  X,h. 

probability  of  one  arrival  =  X,6. 

phs.  if  r'  =  j  -  I  (one  departure). 

P.(r')  =  I  -  hs.  if  r’  =  s  (no  departure). 

0.  otherwise. 

The  minimi/ation  of  the  right  side  of  (9)  is  carried  out 
unvicr  two  disimct  options:  a)  the  state  I  /..  r )  blivcks  a  new 
arrival,  or  b)  ii  allows  a  new  arrival  in  the  NV.Mcin.  Thus  |9) 
deftneA  the  deicrminislic  optimal  ptdicy  within  the  cI.isa 
(rj„.  The  global  optimal  policy  mii>l  be  selected  among 
the  optimal  policies  of  .ill  classes,  wnh  X,  ■  p  <  Xf'  *.  .U. 
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HqualKin  (‘^)tan  be  uaa\I  lOA.trrv  <>iii  .1  'O.irAli  .iliioriibin 
for  ihc  dclcriuin.iiion  of  ibc  optim:il  poliA\  1  lio  .il'.s'iiilmi, 
referred  10  as  the  polics  improAcincni  .ileoriilim  (see  |22.  p. 
1501)  slaris  with  an  arhiirars  inid.il  poliev.  soUes  die 
resulting  optimality  equation  for  lnl..i  \  and  e-  and 
searches  for  siate-aeiion  assignment  alternatives  leading  to 
a  smaller  cost  under  the  same  h{l..r).  1  he  improved 
policy  is  set  for  the  new  iteration  until  no  improvement  ean 
be  found.  The  advantage  of  this  algorithm  eonipared  to  the 
linear  programming  approach  is  that  it  searches  only  for  a 
deterministic  optimal  policy,  liquation  t*))  may  he  used  to 
derive  some  properties  of  the  optimal  policy,  m  order  to 
reduce  the  amount  of  searching  and  the  computational 
complexity  of  the  policy  improvement  algorithm. 

Optimality  and  the  Movable  Bouiultirv:  The  optimal 
policy  discussed  above  can  he  represented  as  a  function 
/(f).  where  l{r)  is  the  minimum  I.  for  which  (  L.  r)  is  a 
blocking  state.  In  Fig.  2  the  transition  diagram  for  the 
birth-death  line-switching  traffic  case  is  given.  The  dwell 
time  Tr  (the  number  of  transitions  the  prixess  stays  in 
state  f )  will  have  a  mean  value 

^  pbr  +  X^bP(L  <  l(r))  pMb  +  X,b 

(see  (28)).  It  is.  therefore,  reasonable  to  assume  that  the 
packet  queue  size  is  a  much  faster  changing  process  and 
hence  reaches  equilibrium  within  Tr.  The  factor  by  which 
equilibrium  is  reached  independently  of  the  initial  state 
depends  on  the  eigenvalues  of  the  one  step  transition 
matrix  for  the  queue  state  equation.  It  is  shown  in  (22|  that 
the  process  ‘*shrink.s"  to  its  equilibrium  by  a  factor  equal  to 
the  product  of  the  absolute  values  of  these  eigenvalues,  the 
so^alled  shrinkage  factor.  Typical  values  of  the  shrinkage 
factor  for  this  particular  queue  model  range  from  lO  to 
lO"*". 

Under  the  above  reasoning,  the  expected  queue  length 
will  depend  only  on  the  stationary  probabilities  r’,(r): 

M 

E,{L)  =  1  E{L\r)P,{r). 

r  =0 

with  £(  L/r)  independent  of  w. 

The  Markov  decision  priKess  that  minimizes  the  average 
cost  (8)  under  these  conditions  is  thus  defined  by  the  state 
f^.  decisions  A  =  0. 1.  and  transitions  as  in  Fig.  2.  I  he 
optimal  policy  r  will  again  be  deterministic:  i.e..  the  set  o( 
states  is  partitioned  into  two  subsets  thrv'ugh  a  bvsundary 
M':  if  f  <  M\  a  new  setup  call  is  served;  if  r  >  .V/’.  a  new 
call  is  blocked.  This  is  identical  to  the  movable  bounvtarv 
scheme  described  in  the  introduction.  .V/'  will  depend  on 
the  value  of  a.  the  weight  factor.  Note  that  not  all  movable 
boundary  .schemes  may  correspond  to  an  n-  optimal  scheme. 


X.bPic  •  /  ‘  I 

I  2  Mtflll  Ji'jlh 


I  sing  the  slate  notation  of  Section  ll-C  and  (.S|  for  PI.  in 
|S).  we  formulaic  the  finite  state  Markov  decisuvii  process 
with  transitions  1 1 )-  (4)  and  average  expected  cost  mimmi- 
/at ion  as 

m,n  j/:,(L,)^of:,['  ^-)|  = 


From  (2)  (derived  from  the  memoryless  property  of  the 
exponential  distribution),  we  may  easily  show  that 

=  lP(r,,^\S,)PAS,)=e  '•'•£,(5,). 

.Assuming  irreducibility  with  "n  belonging  to  an  ineducibil- 
itv  cla.ss  (rr)  A, .  the  process  achieves  equilibrium  and  £.(  r ) 
=  e  “'’£,(5)  or 

£.(£)  =  e"'’£,(r).  (II) 

Redefining  the  weight  a  —  (o - A,/>)( e'*'’  -  1)  and 
g  —  g  -  fl  we  have,  from  (10)  and  (11).  as  the  equivalent 
cost  function  to  be  minimized; 

g  =  min  {£,(£)- o£,(r)}.  a  >  0.  (12) 

ri.  (ff)„ 

Equ  ations  ( 1  )-(4)  and  the  ciist  function  (12)  guarantee  the 
existence  and  deterministic  nature  of  the  optimal  policies 
within  the  irreducibility  classes  (tr)*, .  These  policies  assign 
the  number  of  line-switched  slots  in  a  frame  S  such  that 
/•  s  S  s  min(r  +  e.  M').  based  on  the  state  X  -  {L.r.  i-); 
X  -S. 

The  average  expected  cost  in  (12)  suggests  that  the 
one-period  cost  C(  X.  S )  is 

C(X.S)  -  C(L.r)  =  L  -  ar.  (13) 

The  optimality  equations  can  he  simplified  by  reducing 
the  three-dimensional  state  space  to  a  two-dimen'ional 
A>ne.  .After  some  algebra,  it  is  concludi’d  that  the  optim.iliiv 
equations  are  defined  over  the  set  of  bounded  functions 
li'i  L.r)  satisfying 


CMAX 

X  r-  A'(  /..  r  )  =  /.  -  or  -t-  £( "  ) 

t  II 

I  y  V 

■  min  S  2;  />,(r')/>.(£'|£)/i'(/.'.r-) 

1  '•  s-  I 1  '  "  '  '• 


(14) 


B.  The  E.xuct  Mode! 

We  now  priKced  to  the  exact  miHlel  anaivsis  under  the 
second  optimality  criterion.  Since  the  number  of  arriv.iK  or 
departures  is  not  rcstncicd  to  one.  the  resiilis  of  Sccti.m 
(V-A  under  the  birth  dc.iih  assumption  arc  iioi  applicahk 


with  0  -:  /.  (>.  0  ^  r  -  .V.  and  0  <  ('MAX  (Rec.ill 

that  CM  A.X  is  the  maximum  number  of  c.ills  per  framo  ui 
the  truncated  Poisson  .irriv.il  pnH.css  )  Prob.ibilines  f’lc  ). 
/’(r  ).  /’,(r  ),  /’_(/.'|/.)  arc  given  bv  (I).  (2i.  and  i.'l. 
rcspeetoelv  The  policv  impr>ivemeni  algoritlim  ilelmed  bv 
( 14)  follows. 


I 

Let  H{S.  1.)  vlcmuc  the  I'unetuMt  t\>  he  ittniiiut/et.!  in 
(14): 

//(S.Z.)=  S  2  /’.('■■)/*.(  L' I /.)//•(/.•./■).  (13) 

I  ■  It  f  It 

Within  the  irreducihility  cLins  { ?  j  „  .  we  denote  a  delcrniin- 
isiic  policy  by  the  Male-action  niappiiig  .(„(  L.  /•.  f )  -  .V. 

The  algorithm  is  then  formulated  as  follows:  for  every 
class  of  policies  {w)„  .  A, .  fi  <  A/'  s  M. 

Step  I:  Initialize  to  the  movable  boundary  policy: 
A/)f  ( L.  r.  f)  =  min(r  +  $>.  A/'). 

Step  2:  Solve  the  linear  system  for  p  and  h'{  L.  /■): 

g  h'{L,  r)  =  L  -  ar 

CMAX  V  v 

+  1  1  1  /‘,{r-)P,(L‘\LmL'.r'). 

»-0 

V(Z..r), 

with  S  =  A^iiL.  r.  »).  h'{0.0)  =  0. 

Step  3:  Compute  //( S.  Z. )  of  ( 1 5 )  for  0  S  S  ^  A/'.  0  < 
L^Q. 

Step  4:  For  every  {L.r.e).  »  >  1.  r<M'.  find  the 
integer  S  in  the  interval  r  S  S  ^  min  {r  +  e.  A/'}  which 
minimizes  H{S.  L).  Denote  this  S  by  .4\,  (  L.  r.  *'). 

Step  5:  If  A\,iL.  r.  p)  A^,  (L.  r.  p)  for  some  states 
( L,  r,  p).  sci  A  «•(  L.  r.  r )  —  A\,\  L.  r.p)  and  go  to  2.  If 
not,  stop.  Select  the  policy  A^,iL.r.p)  resulting  in  the 
minimum  g  for  all  M'.  A,/>  <  A/'  s  A/. 

The  optimal  policy,  in  this  general  case,  does  not  neces¬ 
sarily  have  a  movable  boundary  format  as  it  did  for  the 
birth-death  approximation.  It  has  been  found  in  all  the 
examples  we  have  worked  out  that  H(S.  L)  defined  by  ( 15) 
is  a  convex  function  of  .S  for  all  0  <  L  s  (>.  achieving  a 
minimum  S„,„{  L )  within  0  <  S  <  M'  (.see  Fig.  3).  Then  the 
optimal  policy  will  assign  to  every  state  {L.  r,p]  the  action 
S  with  S  the  closest  integer  to  S,„,„{L)  satisfying  the 
condition  r  S  S  ^  min  {r  p.  M').  or 

(5„.„(Z.).  ifr<S„JZ.)<r-t- I-. 

(iL.r.*-)ls=  r.  ifr>S,„jZ.). 

r+p.  i{r  +  p<S„,jL). 

The  seemingly  complex  policy  thus  reduces  for  convex 
H{S.  L)  to  a  simple  one  that  depends  on  the  packet  queue 
size  L.  The  other  components  of  the  state  vector  X  = 
{L.r.p)  are  taken  into  account  in  simple  fixed  point 
comparisons  and  additions.  The  multiplexer  needs  to  main¬ 
tain  in  its  ItKal  storage  only  the  table  S„,  „(  L )  of  size 
0+1. 

We  have  not  been  able  to  prove  formallv  the  convexity 
of  H{S.  /. ).  However,  the  intuitive  understanding  of  the 
optimal  policy  format  aiul  our  compiitation;il  experience 
suggest  that,  for  all  pr.ictical  purpoNCs.  the  st.iteiiient  above 
is  valid. 
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A.  Birth-Death  Appro.umanaii 

Two  numerical  examples  were  studied  under  this  ap¬ 
proximation.  The  linear  programming  and  policy  improve¬ 
ment  algorithms  were  implemented  on  a  computer.  The 
policy  improvement  algorithm  led  to  movable  boundarv 
schemes  as  expected,  within  at  most  three  iterations.  Lme.ir 
programming  was  applied  only  to  the  first  example  (con¬ 
sisting  of  126  variables  and  66  constraints)  since  the  si/.e  of 
the  second  example  (3K6  variables  and  147  constraints)  did 
not  permit  convergence  of  the  mivdified  simplex  method 
used.  The  two  examples  follow. 

Case  /;  The  following  parameters  were  used  in  this  case: 

A,  =2.0  messages/s.  l/;t  =  I  s.  p,  =  A,//t  =  2. 

A,  =  2000  packets/s.  Pj  =  A_./>  =  2. 

/»  =  0.001  s.  A/ =  6  slois/frame.  0  =  8  packets. 

In  Fig.  4  we  have  plotted  the  resultant  loss  probability 
versus  the  average  queue  length.  Points  resulting  from 
{£■(/.)  +  aPI)  minimizations  (small  circles)  belong  to  the 
optimal  curve  {min  £(  L).  PI  <  K  }  extrapolated  from  the 
linear  programming  results  (black  circles).  The  former 
points  are  achieved  through  movable  boundary  policies 
( .A/’ =  3.4. 6).  whereas  the  latter  correspond  to  nonde- 
terministic  schemes.  We  include  the  movable  hounda'-y 
policy  with  M'  =  5.  which  does  not  belong  to  the  v»ptimal 
curve.  It  is.  however,  very  close  to  that  curve,  offering 
excellent  (suboptimal)  performance.  In  order  to  check  the 
validity  of  the  reasoning  in  Section  IV-.A  concerning  the 
fast  evolution  of  the  packet  prvvcevs  compared  to  the  line- 
switching  slot  number  variations,  the  shrinkage  factor  for 
transition  matrix  P^ll.\l.')  was  evaluated.  It  was  found  to 
range  from  0.26  x  10  for  S  =-  I  up  to  0.22  ■  10  '■  lor 
A'  =  6.  The  values  of  this  factor  justify  the  fast  evolution 
assumption,  .^s  a  further  check,  the  statuvnary  condituui.il 
expectations  tA  t.  .V)  were  used  to  compute  £(  L)  and  /’/ 
f«vr  the  movable  b«vundary  schemes  with  M'  -  3.4. 5.6. 
The  steady-state  probabilities  PjS)  were  assumed  to  be 
given  by  the  Frlang-fl  probabilities  .A  comparison  with  the 
policv  imprvivement  cv.ict  rcMiltv  vhowed  that  the  two 
meiluHls  agreed  within  two  ilecim.il  iligits 

(‘iivi’  \Ae  repe.iled  the  ciilciil.iiion'  .ibove  (except  li>r 
the  linear  progr.immmg  approach)  f»»r  a  larger  Nt.ite-sp.ice 
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example; 

\i  =  4.0  messagcs/s,  \/n  =  I  s,  p,  =  \,/n  =  4. 

\j-  4000  packets/s.  p,  =  A,6  =  4. 

b  =  0.001  s.  M=  10,  Q=  \1  packets. 

Results  are  plotted  in  Fig.  5.  Again,  all  movable  boundur> 
policies  either  belong  to  the  optimal  curve  or  are  quite 
close  to  it.  The  validity  of  the  fast  evolution  assumption 
was  checked  up  to  two  decimal  digits. 

As  a  conclusion,  under  the  birth-death  assumption  the 
movable  boundary  schemes,  which  turn  out  to  be  quite 
dense  in  the  region  of  interest  (  PI  <  0.05).  offer  optimal  or 
near-optimal  performance. 

B.  The  Exact  Minlel 

The  policy  improvement  algorithm  of  Section  IV-B  was 
implemented  for  the  general  case  (the  number  of  arrivals 
and  departures  are  permitted  to  have  any  value).  The 
algorithm  is  used  to  determine  the  limitations  of  the 
birth-death  assumption  and  to  derive  the  properties  of  the 
optimal  policy. 

If  The  t'aliJiiv  of  the  Birth -Death  Assumption:  We  ap¬ 
plied  the  algorithm  to  the  example  of  case  I  in  Section  V-.A 
with  the  parameter  a  =  .1.  leading  to  an  optimal  boundary 
scheme  with  M'  =  4.  The  exact  optimi/ation  was  earned 
out  by  varying  the  frame  length  h  from  ti.OOl  0.1.  It 
turns  out  that  the  general  policy  impfovcmeni  algonihn) 
leads  to  the  same  optimal  schemes  (movable  boundary  with 
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M'  =  4)  and  performance  measures  as  the  simplified  one 
under  the  birth-death  assumption,  with  values  of  h  less 
than  0.01.  For  larger  frame  durations  (lower  speed  trunks) 
(he  optimal  deterministic  schemes  are  different  from  the 
movable  boundary  ones,  having  the  table  look-up  form 
described  in  Section  IV-B. 

2)  Policy  Improvement.  Case  J.  The  following  parame¬ 
ters  were  selected  for  the  application  of  the  general  policy 
improvement  algorithm: 

A I  =  0.6  messages/ s.  \/p  =  3.33  s.  P|  =  2. 

As  =  2.0  packets/s.  p>  =  2. 

6  =  1  s.  M  =  6  slots/frame.  P  =  8  packets. 

This  case  ha.s  the  average  message  length  three  times  the 
frame  size,  hence  the  birth-death  assumption  is  not  valid 
Optimal  deterministic  policies  were  computed  for  different 
values  of  a.  The  corre.spondmg  average  queue  length  £(  i. ) 
and  probability  of  loss  PI  have  been  plotted  in  Fig.  6.  For 
comparison,  movable  boundary  schemes  were  evaluated  .is 
well  (.small  triangles  in  the  figure),  .^s  expected.  /■'(/, )  is 
increasing,  and  PI  is  decreasing  with  the  weight  n.  L  nlike 
the  birth- death  case,  the  optimal  points  found  by  varvmg 
a  form  a  rather  dense  or  closely  spaced  set.  Hence  there  is 
no  need  for  the  complicated  linear  program  to  determine 
the  optimal  curve.  For  a  given  level  of  loss  prob.ibiliiv.  a 
deterministic  scheme  or.  equivalently,  a  weight  n  car.  be 
found  minimi/mg  the  average  queue  length.  This  flexibilnv 
in  adju.sting  the  multiplexer  performance  to  an  optimal 
combination  of  line-switching  probabiliiy  of  loss  and 
packet-switching  time  delay  is  the  mam  advantage  of  the 
method 

File  optimal  policies  were  found  to  have  the  form  vlc- 
scribevl  in  Seciion  l\'.-U.  i.c..  foi  every  /..  the  lunnber  ol 
packels  m  queue.  .i  number  .S',,,,,,)/.)  exists  such  that  the 
opiinial  policy  assigns  .iciions  0  ■'  .V  •  M'  to  st.iie  t  /..  r.  v  ) 
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as  foUows: 


S  = 


r, 

r  +  r. 


<  r  ^  i-. 

if  S„jL)<r. 
ifr  +  »<S„JL). 


As  previously,  r  is  ihe  number  of  linc-swiichcd  sessions 
requiring  continuation  of  service.  •>  represents  the  new 
setup  line-switched  calls  per  frame,  and  5  is  the  number  ot 
slots  allocated  to  the  line-switched  traffic.  In  Table  I.  we 
show  S„,„(  L )  as  found  for  the  values  of  a  used  above.  As 
expected,  S„,„(L)  is.  in  general,  increasing  with  a  (i.e.. 
more  emphasis  is  given  to  the  probability  of  loss)  and 
decreasing  with  L.  Note  that  in  the  case  of  a  2:  3.25  and 
L>  5,  this  pattern  is  violated  (see  lower  right  part  of  Table 
1).  This  IS  mainly  due  to  the  finite  queue  length  assump¬ 
tion.  which  imposes  boundary  conditions  on  the  optimiza¬ 
tion  problem.  Note  that  the  packet  queue  overflow  proba¬ 
bilities  did  not  exceed  S  percent  for  the  selected  queue  size 
G  =  8. 

Comparing  with  the  movable  boundary  policy,  we  note 
that  the  improvement  in  the  e.xpected  queue  length  vari¬ 
ance  provided  by  the  optimal  policy  is  not  significant  over 
the  movable  boundary  scheme  for  comparable  values  of 
loss  probability.  (Note  that  the  expected  queue  length 
coordinates  in  Fig.  6  are  greatly  expanded.) 

In  order  to  further  illustrate  the  comparative  perfor¬ 
mance  of  the  optimal  and  the  movable  boundary  schemes, 
we  evaluated  their  performance  under  vari.ition>  of  the 
input  line-switched  traffK-  rate  X,.  We  constrained  ihe 
line-switching  loss  probability  PI  to  a  maximum  acceptable 
level  of  2.8  percent  and  computed  the  corresponding  per¬ 
formance  of  the  optimal  and  movable  boundarv  schemes. 
Every  optimal  policy  search  involved  several  runs  l»>r  the 
determination  of  the  weight  providing  a  loss  probability 
closest  to  the  2.8-pcrcent  figure.  For  comparison,  the  per¬ 
formance  of  the  fixed  boundarv  .scheme  (see  |20|)  was 
evaluated  as  well.  This  scheme  allvKales  iwo  predeierimned 
portions  of  the  frame  to  the  two  traffic  categories  wnhoui 
permitting  the  packet  traffic  to  cKcupy  any  idle  sUiis  m  the 
line-switched  poriion. 

We  further  assumed  that  signaling  p.ickeis.  necessarv  h'r 
the  control  of  the  line-sw itched  sessions  (path  setup,  map¬ 
ping  of  line-switched  slots,  termination  of  calls,  etc.)  are 


transmitted  as  onlinarv  (lackeis;  thus  tliev  intri>i!ui.e  o\er- 
head  in  the  p.icket  ir.illic  kke  approviiiiaied  this  oveilic.id 
Using  the  incihiMl  described  in  1 1 1.  Specilic.ilK.  we  .issuined 
that  every  Ime-sw itched  sessuin  introduces  a  p.ickei  over¬ 
head  equal  to  X'(/\,  ft)  with  K  a  parameter  depeiuhng  on 
the  signaling  protocavl.  The  packet  input  rate  is  then  given 
by  X.  =  X',  +  X(X|  ft),  with  X',  the  input  rate  of  the  data 
packets.  We  did  not  explicitly  consider  this  overhead  in  the 
example  of  ca.se  3.  since  the  values  of  X,  and  X.  were 
assumed  fixed.  Here,  the  dependence  of  X ,  on  van.iiions  of 
X,  must  be  measured  since  we  let  X,  vary.  We  chose 
K  =  (^  3  for  the  particular  calculations,  a  value  used  in  [I]. 

In  Fig.  7  we  demonstrate  the  performance  of  these 
schemes  (optimal,  movable,  and  fixed  boundary)  with  the 
line-switched  traffic  rate  X,  varying.  It  can  be  seen  that  ihe 
movable  boundary  schemes  result  in  almost  optimal  packet 
queue  length  Fig.  7(a)  with  probability  of  loss  adjusted 
below  2.8  percent.  In  Fig.  7(b)  the  llexibility  of  the  optimal 
scheme  in  reaching  a  given  level  of  loss  probability  is 
demonstrated.  The  packet  input  rate  X.  variation  with  X, 
due  to  the  signaling  overhead  is  shown  in  Fig.  7(c).  This 
variation  introduces  the  early  queue  overflow  of  the  fixed 
bounda.'v  scheme  in  Fig.  7(a). 

Similar  results  are  found  with  the  variation  of  the  packet 
input  rate  X>.  while  keeping  the  line-switched  traffic  rate 
X,  fixed.  X|  =0.6  me.ssages/s  was  chosen  as  the  fixed 
parameter  in  the  calculations  here.  These  results  appear  in 
Fig.  8. 

Finally,  a  .sensitivity  analysis  for  small  variations  of  X, 
around  the  reference  point  X,  =  0.6  me.ssages  s  and  X ,  =  2 
packets  s  of  case  3  showed  no  significant  difference  be¬ 
tween  the  optimal  policy  with  a  ~  2  (.see  Fig.  6)  and  the 
movable  boundary  scheme  with  M'  =  4.  These  results  are 
plotted  in  Fig.  9.  Note  that  the  two  sets  of  curves  are  both 
approximately  linear  about  the  initial  operating  point  and 
track  one  another  rather  closely  over  the  range  of  variation 
of  X,  shown. 


VI.  CO.SCLUSION 

The  problem  of  dynamically  alliKating  the  bandwidth  of 
a  trunk  to  both  line-switched  .ind  packet-switched  digital 
traffic  has  been  formulated  in  this  paper  as  a  Markov 
decision  privcess.  Line  switching  was  mixleled  as  a  time 
division  multiplexing  loss  scheme  using  a  varying  portnvn 
v>f  a  fixed  lime  frame.  Packet-switching  traffic  is  served  bv 
the  remaining  portion  of  the  frame  and  requires  queueing 
at  the  multiplexer-ct'ncentraior. 

Two  different  cost  criteria  were  examined  involving 
probability  of  loss  for  line-swuching  and  average  queueing 
delav  for  packets.  The  corresponding  optimization  prob¬ 
lems  were  presented  under  reasonable  simplifying  assump¬ 
tions. 

For  tlie  birth  death  approximate  model,  valid  for  high¬ 
speed  trunks  serving  sufficientlv  lengths  hne-svMicheil  sC'- 
sions  (thus  requiring  mans  fr.imes  for  tr.insmisMon).  ihe 
mvvvable  boundary  scheme  was  fouiul  to  provide  opiim.d 
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points  in  the  loss  prohjhilily  jveruge  pjckel  dclny  plane. 
The  optimal  points,  falling  on  the  curve  eonneciing  the 
movable  boundary  points,  may  be  aehiesed  bv  more  com¬ 
plicated  prid  even  nondeterminisiic  schemes.  TImse  are. 
however,  difficult  to  define  and  to  implement. 

Fur  larger  values  of  the  frame  duration  (or  shorter 
time-switched  sessions!,  the  birth  death  assumpiiixn  is  m>i 
valid  and  optimal  deiernnmstic  schemes  found  are  differ¬ 
ent  from  the  mosable  boiindais  ones.  An  oplim.il  Jt  it  nnin- 
ixiic  policy  can  be  defined  in  mder  to  mmimi/e  the  packet 
delay  with  the  loss  probabilits  close  enough  to  a  given 
value.  The  structure  of  such  policies  is  simple  and  ni.iv  be 
easily  implementevi  for  an  iniellige!'.l  muliipicver  with  the 


capability  of  table  look-up.  The  length  of  these  tables 
equals  the  packet  queue  length. 

It  is.  however,  possible  to  achieve  acceptable  levels  of 
loss  probability  and  near-optimal  packet  delav  vsnh  mov¬ 
able  boundary  schemes.  We  therefore  suggest  the  u.se  of  a 
movable  boundary  technique  for  those  integrated  sv stems 
requiring  a  fixed  frame.  It  offers  excellent  perform.incc  .md 
simplicity  of  implementation.  We  have  thus  shown  th.ii  the 
movable  boundarv  technique,  introdiacd  some  ve.irs  .igo 
in  an  «»</  hn  manner  as  a  simple  scheme  for  imptovmg  tlie 
cap.icity  ulih/.alion  of  the  fixed  boundarv  privedure.  does 
m  fact  display  near-optimum  properties  m  the  sense  dis¬ 
cussed  in  this  p.iper. 
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Appendix 

Irreoucibilitv  Condition 

In  order  to  proceed  in  determining  the  optimal  policy,  we  need 
to  specify  whether  the  proces.s  exhibits  the  irreJuahitin  amJiiwn. 
This  condition  (22j  states  that  under  any  deitrmiiuatu  policy 
(assigning  an  action  to  a  state  with  probability  one  regardless  of 
the  past  hi.siory  of  the  process),  the  resulting  discrete-time  Markov 
process  consists  of  one  class  of  communicating  states.  In  our 
case,  there  are  policies  which  do  not  allow  the  number  of  line- 
switched  (LS)  sessions  per  frame  to  exceed  a  given  number 
M'  <  M.  (For  example,  the  simple  movable  boundary  policy 
described  in  Section  I  does  not  allow  the  LS  traffic  to  occupy  the 
whole  frame.)  Under  these  policies,  states  X,. ,  with  .W  <  r,. ,  < 
M  may  not  be  reached  from  state  X,  with  0  5  r,  £  A/.  In  order  to 
preserve  the  irrcducibility  condition,  we  therefore  need  to  define 
the  following  classes  of  policies  ( referred  to  hereafter  as  “irrcduci- 
bility  classes”)  based  on  the  different  barriers  imposed  on  the 
line-switched  slot  numbers  per  frame;  the  set  (ir)  of  all  policies  is 
divided  into  the  mutually  e.xclusivc  subsets  or  irrcducibility  classes 
{ir)„  .  0  <  A/'  £  Af.  such  that  any  policy  belonging  to  the  class 
!«)„  mav  assign  actions  achieving  but  never  exceeding  M' 
(S,  S  A/') 

If  we  limit  the  search  for  an  optimal  policy  within  a  particular 
class  {«)i, .  the  state  action  space  is  reduced  to  0  r,  £  M'  and 
0  S,  £  V.  Obviously,  any  deterministic  poliev  belonging  to 
|ir}„  results  in  an  irreducible  Markov  chain.  (I)  permits  i-.  to 
take  any  value  up  to  CMAX;  (2)  lets  r,  take  any  value  up  to  S, 
with  S,  <  M'.  and  (3)  from  M/G/S  queue  analysis  defines  an 
irreducible  process  with  0  /.,  r-  Q  for  anv  S,  pattern  as  long  as 

the  frame  format  can  accommodate  the  average  traffic  require¬ 
ment  The  latter  condition  leads  to  lower  bounds  for  Af'  and  A/ 
M'  must  be  greater  than  the  I.S  traffic  intensitv  and  \f  ereaier 
than  the  total  traffic  intensitv  (line  switchinc  plus  packet  switch¬ 
ing).  Thus  A,/u  •  A/'  ■  A/  and  X,  .  fi  -  X  ./'  •  Af  This  reason¬ 
ing  suggests  that  the  irreducibilitv  condition  hi'Ids  within  an 
irreducihiliiy  class  of  policies  and  for  the  accordingly  reduced 
Slate  .space 
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ABSTRACT 


Various  satellite  access  alternatives  are  studied  in  terms  of  accommodating 
packetized  traffic  mix  including  a  tight  delay  constrainted  component.  The  time  critical 
traffic  delay  due  to  contention  and  fram.e  latency  is  assumed  to  be,  at  most,  50  msec  with 
959^  confidence.  Two  major  access  categories  are  studied.  Under  the  first,  t^e  time 
critical  traffic  is  served  in  a  dedicated  subchannel,  either  via  slotted  ALOKA  random 
access,  or  via  TDM  slots.  As  a  result,  the  non-time  critical  bulk  traffic  is  decoupled  from 
the  stringent  delay  constraint  and  may  use  more  efficient  demand  assignment  technique. 
Under  the  second,  a  fixed  TDVA  frame  is  partitioned  into  fixed  portions.  Each  transmitting 
earth  station  fills  its  dedicated  portion,  giving  priority  to  the  time  critical  traffic.  The 
partition  should  be  able  to  accommodate  the  anticipated  time  critical  traffic  within  the 
given  specifications. 

Analysis  of  the  above  schemes  is  being  carried  using  slotted  ALOFA,  M/D/l,  and 
V/D/N  aueueing  models.  It  is  shown  that  the  slotted  ALOHA  alternative  provides  a  ven’ 
poor  candidate.  Among  dedicated  Tn\7  subchannels  and  oriofity  TD^’A  alternatives,  the 
latter  is  shown  to  provide  a  simple  and  robust  scheme,  especially  suitable  in  case  cf 
relatively  fixed  traffic  proportionality  among  the  accessing  earth  stations. 
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I.  INTRODUCTION 
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Satellite  channels  have  been  widely  recognized  as  an  attractive  alternative  to  lone- 
haul  voice  and  data  communications.  Specifically,  the  development  of  high  speed  digital 
transmission  hardware  and  high  speed  burst  modems  permits  the  transmission  of  digital  data 
at  rates  above  50  Mbps.  Broadcast  geostationary  satellites  have  been  used  for  the  transfer 
of  packetized  data  among  a  population  of  users  with  appropriate  earth  station  (ES) 
equipment.  The  corresponding  protocols  for  satellite  packet  network,  e.g.,  satellite  access, 
transponder  capacity  sharing  and  error  control  have  been  developed,  based  on  the  satellite 
channel  salient  characteristics:  Broadcast  nature,  250-270  msec  one-hop  propagation  delay, 
multiple  access  (shared  use  of  the  satellite  transponder  capacity),  and  Gaussian  noise 
statistics.  Frequency-division  multiple  access  (FDMA)  techniques  have  been  traditionally 
associated  with  voice  satellite  com.munications.  Recently,  however,  time-division  multiple 
access  (TDM A)  has  been  suggested  as  a  more  flexible  and  efficient  technique  for  both 
digitized  voice  and  packet  data.  In  TDMA,  U^e  data  from  each  earth  station  are  segmented 
into  bursts  of  transmissions  which  are  timed  and  interleaved  within  a  time  frame.  Initial 
acquisition  and  svnchronization  are  achieved  via  frame  end  burst  overhead  sx'mbols. 

Depending  on  the  flexibility  of  a  TD^’A  scheme  to  adapt  itself  to  traffic  variations, 
the  following  three  major  classes  are  defined  in  I  1) ;  Dedicated  uplink/dedicated,  downlink 
(Fixed)  TDM  A  with  fixed  data-rate  point-to-point  links  connecting  every  possible  source 
destination  pair,  dedicated  uplink/shared  downlink  having  a  fixed  length  burst  assignment 
per  source,  dynamica.Uv  filled  with  various  destination  data,  and  shared  uplink/shared 
downlink  with  the  source  burst  assignment  dynamically  varying  according  to  the  accessing 
earth  station  traffic  requirements.  From  the  above  approaches,  the  first  is  unnecessarily 
rigid  and  wasteful  for  a  packet  data  network.  The  second  and  third  provide  viable 
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alternatives  depending  on  the  traffic  burstiness  and  delay /throuehput  reouirements.  For  a 
bursty  interactive  packet  environment,  demand  assignment  techniques  have  been  developed 
from  the  early  satellite  communication  phases.  These  techniques,  which  are  associated  with 
the  shared  uplink/shared  downlink  class,  range  from  random  access  pure  and  slotted  ALOHA 
[2],  [3]  to  implicit  [4],  and  explicit  reservation  t  5],  16]  techniques.  The  latter  schemes 
introduce  centralized  or  distributed  control  mechanisms  in  order  to  resolve  schedulinq- 
conflicts  among  the  accessing  units,  thus  increasing  the  system  throughput  at  the  cost  of 
additional  delay  and  implementation  complexity. 

A  natural  extension  to  the  application  of  packet  satellite  network  for  interactive 
traffic  has  been  the  us**  of  the  satellite  large  bandwidth  resources  and  broadcast  natime  to 
support  multiple  purpose  integrated  digital  communications.  These  networks  should  he  a^le 
to  provide  timely  and  reliable  data  communications  to  a  variety  of  users  end  applications.  It 
is,  therefore,  necessary  to  devLse  priority  access  schemes  in  order  to  satisfy  performance 
requirements  for  traffic  categories  ranging  from  interactive  packet  delivery,  requiring'  lee? 
than  half  a  second  end-to-en'^  delay,  to  message-sv^itching  electronic  document  distribution, 
overnight  large  file  transfer,  and  voice/video  communications.  The  POPA  protocol 
described  in  [  1  ] ,  is  a  tyo'cal  scheme  aiming  »t  dvnemicell'’  accommodating  such  a  diversity 
of  applications. 

The  main  issue  associated  with  a  general  purpose  integrated  satellite  packet  network 
is  the  handling  of  the  time  critical  applications.  In  order  to  provide  comparable  service  to 
terrestrial  value-adding  networks,  the  end-to-end  delev  should  be  of  the  order  of  300-3A0 
msec  including  access  queueing  delay  and  latency  end  the  250-270  msec  o'^opagation  delnv. 
Obviously,  no  explicit  reservation  scheme  can  fulfill  the  above  requirement  since  it 
introduces  an  additional  one-hop  reservation  delay.  A  direct  access  scheme  should, 
therefore,  be  evaluated,  such  that  frame  latency,  queueing  and/or  retransmission  statistics 


do  not  violate  the  given  specifications.  The  stringent  time  critical  delay  requirement  is 
expected  to  result  in  some  inefficiency  in  both  the  time  critical  and  the  non-time  critical 
(bulk)  traffic  capacity  utilization.  It  is  the  ourpose  of  this  paper  to  comparatively  evaluate 
various  feasible  access  methods  and  determine  their  design  parameters  as  a  function  of  the 
packet  rate  of  the  time  critical  traffic.  Two  general  access  categories  are  studied,  as 
shov.’n  in  Figure  1.  Under  the  first,  the  time  critical  traffic  access  a  dedicated  subchannel 
either  via  slotted  ALOHA  or  via  time-division  multiplexed  slots  dedicated  tc  every 

earth  station.  As  a  result,  the  bulk  traffic  is  decoupled  from  the  very  stringent  delay 
requirement  and  may  use  an  appropriate  dynamic  assignment  technique.  Under  the  second 
category,  a  TDMA  frame  is  partitioned  into  segments  or  slots,  preassigned  to  the  accessing 
earth  stations.  The  slot  sizes  depend  on  the  relative  traffic  intensities  of  the  earth  stations 
and  are  updated  periodically  with  the  period  spanning  several  frames.  Each  transmitting 
earth  station  fills  its  dedicated  slot,  giving  priority  to  the  time  critical  traffic.  An  Vl/D/y 
queueing  model  is  used  to  define  t^^e  minimum  slot  size  such  that  to  guarantee  proper 
delivery  of  the  time  critical  oackets.  For  a  description  of  this  priority  TUMA  protocol  see 
[7]. 

The  models  used  for  the  performance  evaluation  of  the  above  strategies  assume  a 
finite  population  of  earth  stations  generating  fixed-size  time  cal  oackets  v^ith  Poisson 
statistics.  The  packet  arrival  rate  ^  is  identical  for  aU  ea-lh  stations.  Note,  that  the 
Poisson  assumption  is  justified  by  the  realistic  model  of  a  large  terminal  pooulaticn 
clustered  around  a  particular  earth  station.  The  tolerable  time  critical  access  delay  is 
assumed  throughout  this  paper  to  be  50  msec  within  a  95‘^c  confidence  interval.  'F'  is 
criterion  provides  a  more  meaningful  performance  measure  than  the  average  delay,  despite 
the  analytical  complexity  it  involves.  Whenever  analytically  inaccessible,  the  95^^ 
percentile  is  estimated  from  first  and  second  moments.  A  discrete  event  simulation  is  used 


to  check  the  validity  of  the  above  estimation.  The  50  msec  access  delay  specification  is  a 
reasonable  choice  resulting  approximately  in  300-350  msec  end-to-end  delay  including 
processing  and  oropagation. 
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n.  DEDICATED  TIME  CRITICAL  SUBCHANNEL  STRATEGIES 


In  this  section,  we  evaluate  the  two  methods  for  handling  the  time  critical  traffic  in  a 
separate  dedicated  subchannel.  Since  these  methods  decouple  the  time  critical  and  non¬ 
time  critical  components  of  the  traffic,  their  adequacy  and  ability  to  meet  the  delav 
constraints  are  prerequisites  for  the  use  of  demand  assignment  reservation  schemes  to 
handle  the  bulk  components. 

A.  Slotted  ALOHA  Aeces  Analysis 

A  slotted  ALOHA  subchannel  can  be  used  to  provide  direct  random  access  to  time 
critical  interactive  (lA)  packets.  The  250-270  msec  one-hop  propagation  time  results  in  a 
500-550  msec  delay  to  receive  an  ACK  or  detect  a  collision.  In  case  of  collisions  and 
retransmissions,  the  time  critical  end-to-end  delay  is  violated  by  far.  Hence,  the  only  viable 
solution  is  to  keep  the  collision  probabilitv  below  5®f,  pDowinc  the  95*^^  of  the  M  ppcket*:  to 
experience  only  the  frame  latency  assumed  less  than  50  msec.  (By  frame  latency,  we  mjean 
the  delay  experienced  by  a  packet  from  its  arrival  to  the  time  it  mav  be  considered  fo^ 
service.) 

The  slotted  ALOHA  subchannel  consists  of  slots  of  length  L  (in  Kbits),  T  sec  apart 
shown  in  Figure  2.  The  time  critical  packets  of  f’xed  length  L,  are  generated  in  the  5' 
transmitting  ES's  with  Poisson  rate  >,  packets/sec  and  contend  for  the  allocated  slots. 
Collisions  detected  after  500  msec  arc  handled  by  retransmittine  the  collided  packets  in 
future  slots  selected  at  random  over  an  infinite  interval.  Following  the  analysis  of  the  finite 
population  model  in  [2],  we  have  that  the  average  number  of  retransmissions  per  packet  r 
is  given  by, 
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where,  S  =  M*A*  T  and  G  is  given  by  the  nonlinear  eauation, 

S  =  G(1 (2) 

The  probability  of  collision  for  small  values  of  E  will  be  approximately  equal  to  E, 

E  =  0  •P(0)  +  l-P(l)  +  •••  ~  Pfl) 

To  evaluate  the  oerformance  of  the  system,  we  must  compute  the  maximum  repetitior 
interval  T  for  which  E  5%  under  various  arrival  rates  X.  Obviously,  more  frequent 
repetition  of  the  dedicated  slots  (small  T)  results  in  putting  aside  a  greater  portion  of  the 
channel  bandwidth  for  the  time  critical  traffic,  more  fragmentation  of  the  frame  format 
and,  hence,  deterioration  of  the  system  performance. 

E.  TDM  Subchannel  Analysis 

The  model  assumes  that  slots  of  length  L,  dedicated  to  every  accessing  ES,  are 
separated  by  T  sec  as  in  Figure  3.  The  repetition  period  should  again  be  le&s  or  equal  to  50 
msec  to  account  for  the  frame  latency.  The  time  critical  packets  of  length  L  arrive  with 
Poisson  rate  Xand  wait  on  a  FIFO  (First-In,  First-Out)  single  server  queue  at  the  source  F,?. 
Service  of  outstanding  packets  occurs  every  T  sec  via  the  dedicated  slot.  An  ^VD/l  analysis 
is  performed  to  compute  the  first  and  second  moments  of  the  access  delay.  (For  a  similar 
treatment,  see  [  81 .)  We  assume  that  the  total  delay  D,  consists  of  the  sum  of  two 


independent  random  variables:  Tbe  uniforniiy  distributed  latency  S.,  0  <  S.  <  T  and  the  V/D/i 

2 

queueing  \V.  The  mean  Dand  variance  are  then  given  by, 


^  2(1  -  AT)  2 


2  _  T^  r  A»  T^  1  A»  q 

°D  '  12  ^  [2(1  -  At)  J  3(1  - 


Let  denote  the  95  percentile  of  the  total  access  delay.  VVe  define  the  spread  factor  SF 
as  the  factor  which,  when  multiplied  by  the  standard  deviation,  determines  the  95'’:. 
confidence  interval;  namely. 


D  +  SF  •  0^ 


Various  estimates  can  be  considered  for  the  value  of  the  SF  in  (5).  Using  Chebyshev's 
inequality,  it  turns  out  that, 


Pr  (n  >  f)  +  SF  •  0^)  =  5°'  <  ■  - 

"  (SF)- 


SF  <  4.5 


This  is  a  loose  upper  bound  on  SF.  Other  relaxed  ad-hoc  choices  may  be  SF  =  3,  and  pp  = 
1.9fi  corre^ondine  to  the  normal  distribution.  A  more  reasonable  choice  can  be  found  using 
the  following  argument:  If  T  =  50  msec,  the  probability  of  meeting  the  (n  50  msecf 
const»’ajnt  is  almost  eo.ual  to  the  orobabilitv  of  an  arrival  in  an  empty  queue  (the  access 
delay  consists  of  latency  onlv).  This  probability  eouals  to  A  T.  In  order  to  make  this 
probability  equal  to  5%,  we  select  A  =  1  pac’cet/sec  with  T  =  50  msec. 


8 


X  =  1  p/sec 
T  =  50  msec 


50  msec 


From  (3),  (4),  ^5)  with  X  =  ]  p/sec  and  T  =  50  msec,  we  conclude  that  SF  =  1.5.  In  Table  1, 
we  present  the  95%  delay  under  X  =  i  packet/sec  and  T  =  50  msec  for  different  SF  choices. 
Our  choice  SF  =  1.5  leads  to  the  most  realistic  value. 

The  M/D/1  95%  analysis  presented  in  this  section  is  used  to  evaluate  the  maximum  I A 
rate  which  does  not  result  in  more  than  50  msec  delay  within  95%  confidence  for  different 
values  of  T.  Again,  small  values  of  T  correspnd  to  larger  percentages  of  the  total  channel 
capacity  dedicated  to  the  time  cfitical  subchannel. 


c.  Evaluation 


The  analyses  presented  in  the  previous  sections  have  been  used  to  evaluate  t^r 
repuired  dedicated  caoacitv  (in  percents  of  total  available  caDacity)  as  a  function  of  the 
packet  arrival  rate  of  the  time  critical  traffic.  Two  wideband  satellite  channel  capeeit'e': 
have  been  chosen  as  reference  examples;  =  14  Mbps  and  C  =  20  Mbps.  The  number  of 
accessing  ES's,  X%  assumes  two  extreme  values  5  end  20.  The  capacity  dedicated  to  the 
time  critical  subchannel  follows  directly  from  the  correspondinsr  repetition  time  T  of 
the  dedicated  slots  of  length  L  =  1  Kbit.  Thus,  for  the  slotted  ALPHA  case, 


For  the  dedicated  TD^I  case, 
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L*  ?.I 
T*C 


(7) 


TC 


C 


The  numerical  results  are  plotted  in  Figures  4  and  5.  As  it  can  be  seen,  slotted  ALOHA 
saturates  much  faster  than  the  dedicated  TDM.  This  is  due  to  the  fact  that  the  slotted 
ALOHA  scheme  schould  be  designed  such  that  the  probability  of  contention  is  less  than  5% 
to  account  for  the  500  msec  delay  induced  with  any  collision.  The  dedicated  TDM  can  allow 
contention  of  arriving  packets  at  the  transmitting  queue.  If,  for  example,  the  dedicated  slot 
is  repeated  every  10  msec,  a  packet  may  remain  on  queue  for  five  periods  without  violating 
the  50  msec  delay  constraint.  Therefore,  the  increase  of  the  dedicated  caoacity 
fequivalentlv  the  decrease  of  the  period  T)  has  a  greater  impact  on  the  dedicated  TDM  than 
on  the  slotted  ALOHA  scheme.  In  addition,  S-ALOHA  schemes  involve  more 
imolementation  complexity  and  require  flow  control  for  stability.  V'e  conclude,  therefore, 
that  dedicated  TDM  is  preferred  to  S-ALOHA  for  handling  the  time  critical  traffic  on  a 
dedicated  subchannel. 

Another  important  advantas-e  of  usinp’  a  dedicated  subchannel  aoproach  is  that  these 
schemes  decouple  the  time  critical  from  the  bullc  traffic,  thus  permitting  a  dynamic 
assignment  scheme  for  the  letter.  Their  main  disadvantages  are  due  to  the  protocol 
complexity  of  handling  seoarate  log-'cal  links  and  the  overhead  associated  with  the  frame 
fragmentation. 


ni.  PRIORITY  TDMA 


A.  Motivation  and  Descriotion 


This  section  deals  vnth  the  imolicetinrs  of  incorporatiPT  the  time  critical  lA  traffic 
within  the  bulk  traffic  access.  We  concluded,  in  the  previous  section,  that  slots  of  lensrth  L, 
dedicated  to  every  transmitting  source  and  dispersed  in  fixed  time  intervals,  provide  an 
acceptable  mechanism  for  handling  the  time  critical  packets.  This  motivated  the  design  of 
t^>e  simple  and  robust  priority  TDMA  scheme  presented  in  tt'is  section.  Instead  of  the  single 
server  model  of  Section  D,  B,  we  simplv  collocate  all  slots  dedicated  to  one  source  for  a 
frame  period  less  tf’an  or  equal  to  50  msec.  Py  doing  so,  we  reduce  the  frame  fraementation 
overhead  and  allow  the  non-time  critical  traffic  to  occupy  anv  idle  capacity  not  used  by  the 
time  critical  packets  (recall  that  the  OS";  confidence  constraint  results  in  large  percentages 
of  unused  capacity).  We  briefly  describe  its  mein  features,  (see  Figv're  f>): 

All  traffic  accesses  the  channel  via  a  T  sec  frame,  T  ^  .50  msec.  The  frame 
length  should  be  long  enoutrh  to  result  in  acceptable  fragmentation  without 
violating  the  50  msec  latency  constraint.  Note,  that  the  high  speed  satellite 
channel  is  the  main  reason  for  making  this  scheme  attractive.  As  an  example,  c 
50  msec  frame  carries  1  Mbits  when  usintr  a  20  '!hps  channel. 

A  variable  sixe  slot  is  allocated  to  everv  ES.  The  minimum  allocation  ner  FS 
should  he  such  that  05*^  of  the  arriving  time  critical  packets  can  be 
accommodated  within  50  msec.  The  ES's  serve  the  time  critical  packets  with 
non-pree motive  priority  over  the  other  traffic  classes. 
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The  slot  allocation  remains  fixed  for  a  period  of  time.  Every  ES  has  full 
responsibility  to  allocate  its  data  according  to  given  priority  tables  and  using 
some  scheduling  algorithm.  Thus,  the  scheme  can  be  classified  as  fixed 
uplink/shared  downlink  multiple  access.  It  is,  however,  possible  to  adapt  the 
capacity  assignment  to  the  traffic  variations  usin?  a  centralized  control 
mechanism  as  in  [7].  The  central  controlle**  ES  can  update  the  allocations 
either  by  monitorinc  the  ES's  traffic  statistics  or  after  considering  periodic 
traffic  request  reports.  Note,  that  the  synchronization  and  allocation  update 
processes  mav  span  many  frames.^  In  addition,  the  complexity  involved  in 
updating  the  slot  allocation  suggests  that  this  should  not  be  performed  very 
frequentlv  Itvpicallv  ever'’  1,000  or  2,000  frames).  The  scheme  can,  therefore, 
adapt  to  relatively  slov;  variations  of  the  traffic  mix:  this  is  the  case  of  a  large 
poDulation  of  terminals  connected  to  an  ES.  resulting  in  quite  smooth  traffic 
with  no  rapid  peaks  due  to  averaging  effects. 

In  the  following  section,  v  e  present  the  analytic  tools  needed  to  evaluate  the  r^inimum 
allocation  per  ES  in  order  to  guarantee  proper  time  critical  service. 


B.  Delay  Analysis 

The  95°<r.  confidence  interval  delev  analysis  is  based  on  the  M/P/N  oueue  model  of 
TDM  A  scheme.  Similar  multiserver  queueing  m.odels  hpve  been  reported  in  [®] ,  [  ’h] ,  and 
[11  ).  The  M/p/v.  queue  is  an  accurate  model  for  the  time  critical  packet-servier 
mechanism  of  Figure  £.  Specifically,  the  service  time  of  a  packet  is  defined  as  the  frame 
duration  T,  since  all  packets  are  considered  for  service  at  the  beginning  of  a  new  frame. 


The  number  of  servers  Nj  corresponds  to  the  number  of  time  critical  packets  fitting  in  slot  i. 
Whenever  the  allocation  Q.  is  greater  than  the  minimum  Np  the  delay  constraint  is  further 
relaxed.  We  focus,  therefore,  on  the  computation  of  the  minimum  allocation  Np  For 
simplicity,  we  will  drop  all  subscripts  i,  denoting  ESp  In  order  to  simplify  our  analysis,  we 
assume  that  the  frame  length  T  is  decicted  such  that  the  95%  delay  =  50  msec 

corresponds  to  an  integer  number  of  frames: 

A  ^*95 

^  =  7^.=-^,  K  =  l,2,... 

The  delay  D  encountered  by  a  packet  will  depend  on  the  number  of  oackets  q  found  in  queue: 
specificallv,  we  have. 


(8) 


D  =  Latency  if  q  <  N 

D  =  Latency  +  if  N  <  q  <  2N 

D  =  Latency  2T^  if  <  q  <  3N 

I 

« 

P  =  Latencv  +  iT,.  if  K  <  o  <  (i  + 

h  -  - 


Furthermore,  since  the  frame  latency  equals  to  one  frame  with  lOPys  confidence,  we  have, 

>0 

cj  <iN 

ft  immediately  follows  that  with  P.g,^  =  K  •  Tj,, 

P'*{P<D.g5  }  =  Pr{q<  KN  }  (9) 

0 


0 
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We  wish  to  evaluate  Xj,,  the  maximur'  rate  of  the  tirr,e  critical  traffic  for  which  the  r.h.s  of 
(9)  is  kept  above  95%; 


D 


With 


K 


K  =  1,  2,. 


Find,  Xj^  =  max  {X}  (i0) 

•  Such  that,  Pr{r'<D.pj}  =  Pr{q<KK}  ^  95% 

The  queue  state  statistics,  q,  needed  to  evaluate  (10)  are  summarized  in  the  Appendix.  For 
^  K  =  1>  Pr  {q  <  N}  is  given  explicitlv  by  (A-3)  in  the  Appendix.  Thus,  the  maximum  rate  X  ^ 

can  be  easily  computed  for  various  values  of  N  using  a  simple  search  algorithm.  For  K  >  1, 
no  exact  formula  for  the  probability  Pr{  q  <  KN)  is  available.  Instead,  we  use  an 
^  approximate  method  based  on  the  first  and  second  moments  of  the  queue  state  as  evaluated 
in  the  Appendix.  Our  aporoximation  consists  of  the  assumption  that  the  WD/N  oueue  size 
distribution  with  arrival  rate  Xp  and  frame  length  Ty,  as  defined  in  (10\  will  have  for  a 
^  given  N,  the  same  shape  if  normalized  with  respect  to  K.  Under  the  above  assum.ption.  for 
every  N  we  define  the  sp''ead  factor  FFfM,  such  that  the  95%  oueue  size  KN  of  (10)  is, 


KN  =  rj^.{q)  +  SF(N)  (11> 

Here,  Ej^  (q)  and  o^  are  the  mean  and  standard  deviation  of  the  queue  size  under  and 
•>  given  by  (A-4)  and  (A-5)  in  the  Aooendix,  The  assumption  that  SPfN)  is  independent  of 

K  permits  its  analytical  evaluation  from  the  known  case  K  =  1.  The  SF(N)  estimate  can  then 
provide  confidence  intervals  for  larger  values  of  K.  Other  choices  for  the  spread  factor. 
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such  as  SF  =  4.5  (Chebyshev's  inequality),  SF  =  3,  SF  =  1.96  ^normal  approximations),  mey 
prove  valid  for  specific  parameter  sets,  but  are  inflexible  in  adjusting  to  every  particular 
queue  distribution.  Our  method,  which  provides  flexible  and  tieht  estimates  has  been 
checked  via  discrete  simulation  (see  the  next  sectionl. 

The  corresponding  algorithm  is  summarized  below: 

Step  1;  For  a  given  and  N,  set  K  =  1,  =  D.gg. 

Find  Xi  =  max  {x}  ,  such  that  Pr{  q<  N}  >  95%  (Eq.  A-3) 

V.’ith  X  =  X^  and  T  =  compute  Ej  (q)  (Eq.  A-4)  and  ^  (Eq.  A-5). 

N  -  E.  (q) 

Compute  SF(N)  =  - 

_ 

For  K  =  2,-3,...,  and 

Find  X,.  =  max  {X}  ,  such  that  KK  <  E,.  ^o)  +  SF(N)*  o 
where,  E^.  (q)  and  are  given  from  Eos.  (A-4)  and  ^A-5)  with  X  =  Xp. 

T  =  T 

K 

C.  Numerical  Results  and  Evaluation 

The  analysis  of  the  previous  section  has  been  carried  out  with  D  =  50  msec  and  I'  = 

5/  0 

1,  2,  3,  4  correspond  in?  to  frame  durations  =  50  msec,  =  25  msec,  T.^  =  16.66  msec, 
and  =  12.5  m;sec.  The  minimum  number  v  of  capacity  "quanta"  (a  quantum  can 
accommodate  one  time  critical  packet  length  j,)  is  varied  from  1  to  45.  The  corresponi^in^ 
tolerable  time  critical  packet  rates  X  1^  =  1?  2,  3,  4,  have  been  computed.  A  fairlv  simple 


Steo  2: 


Step  3: 


time  driven  simulption  propram  has  been  used  to  simulate  the  queue  recursive 

equation  (see  Appendix).  Experiments  have  been  conducted  to  evaluate  the  probability  Pj^, 
of  violating  the  50  msec  delay.  The  simulation  was  run  over  10,000  frames  and  statistics 
have  been  taken  after  the  first  1,000  transient  frames. 

In  Table  2,  '.ve  tabulate  the  analytically  obtained  \  ^  for  N  =  1,...,45  and  the 
corresponding  simulation  result  P^..  In  the  same  table,  ve  show  the  various  SF  (spread 
factor)  choices.  'Fhese  '•esults  are  plotted  in  Fifmres  7  and  8.  Specifically,  in  F'gime  7,  ve 
show  the  maximium  tolerable  rate,  versus  the  total  number  of  capacitv  quanta  dedicated 
to  a  particular  ES  during  1  sec  (N  slots  in  a  =  75  msec  scheme  co'^resoond  to  2N  slots  in 
the  T^  =  50  msec  case).  The  quite  strikins  observation  from  Figure  7  is  the  considerahie 
improvement  obtained  under  the  Tp  =  ?5  msec  frame  over  the  T^  =  50  misec  one.  Their 
imnrovement,  ranging  from  50%  to  70%,  can  be  understood  by  observing  that  the  one  frame 
queueing,  tolerated  under  the  T^  =  25  msec  frame  schem.e,  has  an  averaging  effect  on  the 
bursty  Poisson  arriving  traffic.  It  is  remarkable  to  see  that  further  partitioning  of  the 
frame  (by  a  factor  of  3  and  4)  does  not  lead  to  any  substantial  imp''ovement;  this  is  due  to 
the  fact  that  the  P5%  confidence  constraint  limits  the  utilization  to  lew  levels  end,  hence, 
the  probability  of  large  queue  sizes,  requiring  more  than  two  frames  queueing,  is  very  small. 

In  Figure  8,  we  plot  the  simulation  derived  orobabilities  of  violating  the  50  msec 
constraint  for  the  (N,  pairs  computed  using  the  analytic  technique  described  above. 
These  probabilities  fall  below  the  5%  specification,  thus  indicate  that  our  analysis 
assumptions  are  valid  and,  even  more,  rather  conservative.  Finally,  the  histogram.s  of  tl  r 
queue  size  distribution  are  plotted  in  Figure  P  fo''  sample  values  of  N  and  the  extreme  casc' 
K  =  1  and  K  =  4.  'h^ese  histograms,  obtained  from  the  10,000  frame  simulations,  show  that 
the  queue  size  distributions  have  similar  shapes  for  a  given  N  but  differ  for  different  N 
values.  Hence,  our  algorithm,  which  evaluates  for  every  choice  of  N  a  corresponding  spread 
factor  SF(N),  is  based  on  a  reasonable  assumption. 
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Note,  that  the  results  plotted  in  Figure  7  are  independent  of  the  time  critical  packet 
size  (equal  to  the  capacity  quantum  size).  Fifure  7  can  be  used  as  a  v.orkinp  too]  in  order 
to  evaluate  the  minimum  allocation  per  FS  for  frame  sizes  t  =  SO  m.sec  and  T  <  25  msec.  It 
can  be  easilv  seen  from  the  fig'ure  that  frame  lengths  shorter  than  25  msec  does  not  provide 
substantial  imorovement.  V'e  can,  therefore,  use  the  25  msec  frame  to  approximately 
evaluate  the  minimum  capacity  for  anv  value  T  <  25  msec.  As  an  example,  an  ES  with  time 
critical  packet  arrival  rate  A  =  150  packets/sec  and  frame  lenpth  T  =  20  msec,  needs  under 
the  25  msec  curve  of  Figure  7,  200  quanta/sec  or  4  quanta  per  20  msec  frame.  This 
technique  relaxes  the  simplifying  assumption  (8)  of  integer  number  of  frames  to  the  D  = 
50  msec  delay  specification.  In  case  that  no  other  constraints  are  imposed  (e.c., 
synchronization,  etc.),  the  frame  lensith  should  be  set  eoual  to  D  ,,,  12-  25  msec  for  near- 
optimal  utilization  and  reduced  frame  fragmentation. 


IV.  CONCLUSIONS 


The  various  satellite  access  alternatives  for  the  time  critical  traffic  in  a  general 
purpose  satellite  packet  network  have  been  analyzed  and  evaluated.  The  stringent  end-to- 

0  end  delay  soecification  has  irroosed  a  50  msec  access  delav  with  95%  confidence.  Two 

general  catesrories  have  been  considered.  The  dedicated  subchannel,  first  category,  consists 
of  access  schemes  which  decouple  the  time  critical  traffic  service  from  the  total  bulk 

•  traffic  access  mechanism.  The  dedicated  subchannel  is  used  for  the  direct  access  of 
outstandin?  time  critical  packets.  The  inevitably  introduced  contention  should  be  kept  such 
that  the  delay  constraint  is  not  violated.  The  analysis  has  clearly  shovm  that  slotted 

««  ALOHA  random  access  reduces  substanti’iUy  the  s\’stem.  effici'^ncy  over  the  dedicated  Tl'"'' 

subchannel  strateev.  Furth.ermore,  the  TDH  scheme  is  superior  in  terms  of  its  stability  and 
robustness  and  the  inherent  capability  to  provide  a  control  and  reservation  subchannel  for 

•  the  bulk  traffic,  "^he  bulk  Jow  prio'’ity  traffic  can  be  served  via  any  demand  assignment 
scheme  involving  a.  traffic  adantiPET  control  mechanism,  c.f.,  explicit  reservations.  Ti;c 
mechanism  of  decoupling  the  time  critical  component  of  tfe  traffic  is,  howe^'er,  associntc'' 

•  with  considerable  protocol  comnleyity  and  involves  various  fragmentation  overheads. 

The  second  catesrorv  consists  of  the  Priority  TDMA  scheme  which  provides  a  unifo'’') 
method  for  ell  traffic  categories.  The  accessing  FS's  ere  responsible  to  schedule  tbe  pendir^ 
traffic  within  their  transmitting  bursts  according  to  priority  rules.  A  centralized  cont"ol 
nriechanism  partitions  the  transponder  capacitv  into  dedicated  variable  size  t'me  sjets. 
comprising  a  frame,  such  that  of  the  outstanding  time  critical  traffic  wHl  be  giver' 

*#  access  within  50  msec.  An  "V/n/N  queueing  analysis  has  been  used  to  evaluate  the  mini'T'i'^ 
capacity  assignments  which  guarantee  the  validity  of  the  delay  constraint  over  a  ranee  cl 
anticipated  time  critical  traffic  rates.  A  frame  length  of  ?5  msec  has  been  found  to  provide 

■0 

0 
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pear  optimal  capacity  utilization.  Amonc  the  obvious  advantages  of  this  scheme,  we 
mention  its  stability  and  robustness,  sinfi[le  logical  link  protocol  simplicity,  reduced 
frapmentation  overhead  and  flexibility  to  accommodate  on  oriority  the  ES  traffic  mix.  Its 
disadvantage  lay  on  its  rigiditv  in  real  time  adapting  to  fast  traffic  variations  among  ES's. 

As  a  general  conclusion,  we  feel  that  the  priority  TDi'dA  scheme  provides  a  sim.ple  and 
attractive  access  candidate,  especially  in  the  case  of  ES's  collecting  data  f’*om  a  large  and 
relatively  fixed  terminal  population.  The  bursty  character  of  a  terminal  traffic  is  then 
averaged  over  the  entire  poouletion  and  does  not  dramatically  affect  the  transpon'^er 
partition  requirements.  Non  real-time  bandwidth  adjustments  are,  of  course,  possible  via  e 
centralized  reallocation  orotocol. 


19 


APPENDIX;  tfi/D/V  QUEUE  gTATISTICS 


Let  T  be  the  frnme  duration  end  xthe  time  criticpl  pecMet  arrival  rate  in  packets /sec. 
The  queue  state  v.’ill  he  the  number  of  packets  in  Queue,  just  after  the  opening  of  the  i 
frame.  With  Vj»  nev  arrivals  generated  during  the  frame  and,  at  most  N  packets  served 
per  frame,  as  in  Figure  A.l,  the  queue  difference  equation  will  be: 


+V| 


where. 


I 


0  if  a  <0 

a  if  a  >  0 

and  is  Poisson  distributed  with  mean  p=  <  N.  From  the  ^V^/N  Queue  analvsis  in 
[  12  ],  the  stations rv  penpratin?  function  of  the  Queue  state  probabilities  is  given  by. 


C(7.)  = 


-  pW?  -  1 1 


1  -2  F.XP  [pO  -  z)] 


N  - 

n 

r  =  1 


7  - 


1  _ 


(A-n 


where,  are  the  ^V-1)  roots.  Iz^,!  1  of  the  equation. 


?/■  rvr  [p(l  -7)]  =  1 


fA-"^ 


The  root  of  hA-1 )  is  the  obvious  =  1.  A  simple  numerical  alporithm  is  giver,  in  [1?] 
for  the  calculation  of  these  roots.  V'ith, 


A.l 


•4 


X  =  Re  [z_],  Y„  =  Im  Iz  ] 

r  r  ’  r  r 


it  easilv  turns  out  that  'V’  ,  Y  satisfy  the  nonlinear  svstem, 
■  r  r  • 


r  2r  H  +  p  Y  “I 
Xj,  =  EXP  lP/X(Vj.-l)]  COS  - - - 

r  ?r  n  +  p  y  1 

^  =  EXP  [p/N(X^-l)]  SIX  I - W—^\ 


From  Eq.  (A-l )  ve  pet,  after  some  alfehra,  the  fcUowinp  queue  size  statistics, 


Pr  {a  <  K  }  =  ^  P(q  =  i)  =  ^ - 

i  =  0  H  (1  -  z  ) 


(A-:’.) 


Note,  that  this  is  the  probability  that  an  arrival  encounters  at  most,  N  -  1  packets  in  Queue 
and,  hence,  pets  access  im  media  telv  into  the  next  frame.  The  mean  and  variance  of  c  ere 
eiven  hv, 


E(o)  =  I 


o  O 
V  _  V  •  +  D  ‘ 


1-z.  ?(N  -  P) 


0  ^  =  C?”(i)  +  O'fl)  -[  O’O)  ] 


(A-F) 


0’(1 }  =  ^ 


1  +  0‘  -  N  -  ?DK 


1-z. 


2^N.'  -  p) 
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TABLE  1:  D  FOP  VAPIOUS  SF  CHOICFS 


X  =  1  packet/sec  T  =  50  msec* 


SF  =  4.5  (Chebyshev) 
SF  =  3.0 

SF  =  1.9fi  (Normal) 


^•95  ~  msec 
D.gS  =  90.0  msec 
D.gj  =  77.9  msec 


SF  =  1.5 


■95 


50.2  msec 


TABLE  2:  MAVIVUM  TIME  CRITirAL  PACKET  RATE 


F»*am€ 


50  msec 

25  msec 

Pl% 

^2 

P2% 

1 

4.72 

6 

1.20 

6 

3.70 

33 

1.88 

16 

4.66 

73 

3.20 

26 

3.94 

no 

2.74 

38 

4.32 

150 

2.98 

51 

4.66 

190 

2.96 

64 

4.54 

225 

2.12 

78 

4.56 

265 

2.90 

92 

4.34 

305 

3.12 

106 

4.26 

346 

3.20 

181 

4.46 

546 

3.78 

261 

4.54 

746 

4.23 

341 

4.73 

945 

3.87 

426 

4.89 

— 

— 

511 

4.70 

— 

— 

596 

4.58 

— 

— 

686 

5.04 

— 

— 

Maximum  Rate  (Packets /See) 
Pj,{q/K  >N  }  Simulation 


16.66  msec 


12.5  msec 


ACCESS  METHODS 


FIGURE  1;  ACCESS  ALTERNATIVES 


FRAME  DURATION  (T  <  50  MSEC) 
SLOT  LENGTH  (KBITS) 


FIGURE  2:  S-ALOHA  SUBCHANNEL 
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ABSTRACT 


Coaxial  cable  based,  broadcast  Local  Area  Networks 
(LANs)  are  widely  recognized  as  an  efiective  means  to 
provide  both  communication  capabilities  to  a  variety  of 
office  automation  equipment  as  well  as  interoperability 
among  data  processing  configurations.  This  paper 
presents  the  results  of  a  performartce  analysis  of  a  typical 
environment  to  which  both  of  these  factors  contribute. 
The  environment  includes  a  variety  of  line  speeds  and 
protocols  and  access  is  made  to  remote  data  bases  and 
hosts. 

The  network  carries  messages,  packetized  to  the 
optimal  packet  length  and  transmit!^  via  the  CSMA/CD 
protocol  (Carrier  Sensing  Multiple  Access/Collision 
Detection).  End-to-end  delay -and  access  delay  of  the 
CSMA/CD  protocol,  is  studied.  It  is  shown  that  lor  a 
realistic  traffic  mix,  the  principal  contribution  to  end-to- 
end  delay  arises  from  the  queueing  and  NIU  processing. 

Sensitivity  of  the  performance  of  the  LAN  to 
variations  in  the  number  of  users  and  the  traffic  volume  is 
investigated.  This  analysis  yields  the  result  that  end-to- 
end  performance  is  not  sensitive  to  tignitieMnt  increases 
(up  to  a  factor  of  6)  in  the  number  of  users  and  the 
associated  traffic  volume  above  the  relatively  high 
volumes  used  in  the  reference  scenario. 


1.  INTRODUCTION 

Recent  developments  in  intra-building  or  intra- 
campus  communications  indicate  that  local  networking  is 
taking  a  path  distinct  from  traditional  long  haul 
communications  and  computer-to-peripheral  data  bases. 
The  unique  environment  of  local  networks  in  terms  of 
geographical  dispersion  (from  100  meters  to  1C  Km), 
transmission  speed  (between  9  Kbps  to  30  Mbps), 
unregulated  environment  and  support  of  dissimilar  devices 
and  protocols  is  summarized  in  [l]  .  Interoperability 
a.T>ong  various  user  requirements  is  the  key  factor  to  the 
development  of  local  networks,  often  imposing  the  maior 
cost  and  performance  constraints  instead  of  bandwidth, 
which  was  the  typical  scarce  commodity  in  a  more 
conventional  communications  environment. 

From  the  early  RAD  efforts  in  Local  Area  Networks 
(LAN),  it  became  apparent  that  less  controlled  random 
access  methods  could  be  very  efficient  for  high-speed 
broadcast  media.  This  led  to  the  Carrier  Sensing  Multiple 
Access  protocol  with  Collision  Detection  (CSMA/CD)  as 
first  introduced  in  [2j .  In  parallel,  experiments  showed 
that  off-the-shelf  cable  TV  technology  represents  an 
excellent  vehicle  for  implemeriting  broadcast  LAN  [3]  . 


More  recently,  the  data  communication  and  office 
automation  users  community  has  shown  great  interest  in 
local  networking.  This  interest  was  strongly  influenced  by 
extensive  vendor  activity,  new  hardware/scftware 
announcements,  the  IEEE  standards  committee  for  LAN, 
and  the  intensive  advertising  campaign  in  the  mass  media. 
The  technical  and  scientific  literature  is  flooded  by 
articles  with  the  main  controversial  issues  being  the 
transmission  method  (baseband  versus  broadband/RF)  [a]  , 
[^]  and  network  access  scheme  [6]  ,  [7]  .  Most  of  the 
analytical  work  is  dealing  with  performance  comparisons 
[>] )  [9]  o(  various  proposed  channel  partition 

methods,  in  particular  the  two  that  have  been  blessed  as 
IEEE  standards  by  the  LAN  committee.  Carrier  Sense 
Multiple  Access  with  Collision  Detection  (CSMA/CD)  and 
the  Token  Ring  [10] . 

The  97%  channel  utilization  capability  of 
ETHERNET  reported  by  Shoch  and  Hupp  [11]  ,  completely 
ignores  the  processing  time  at  the  Network  Interface  Unit 
(NIU),  the  interaction  between  the  CSMA/CD  access  and 
higher  level  protocols,  buffer  management  at  NIU,  and 
the  nodes  s^d  mismatch.  Watson  [12]  performed 
simulations  of  two  CSMA/CD  type  networks  with 
differen'  methods  of  buffer  management  and  speed 
mismatch  and  concludes  that  network  performance  is 
stongly  dependent  on  these  factors,  sometimes  reducing 
utilization  to  less  than  30%.  In  [13]  ,  we  presented  a 
detailed  modeling  of  two  microprocessor  architectures  for 
local  network  interface  adapters  and  analyzed  their 
impact  on  throughput/delay  performance  on  end-to-end 
(ETE)  character  transfer.  Our  studies  showed  that  the 
main  limitation  to  LAN  performance  is  due  to  the 
interface  processing  and  qeueing,  while  its  sensitivity  to 
the  coaxi^  cable  speed  and  access  protocol  is  rather 
limited. 

In  this  paper,  we  do  not  attempt  to  model  closely 
the  NIU  architecture  at  the  microprocessor  level.  Having 
established  in  [13]  that  the  bottleneck  exists  on  the 
protocol  processing  and  the  Packet  Assembly/Disassembly 
(PAD),  we  assess  at  the  macroscopic  level  the  ETE  delays 
lor  a  multiple  applications  scenario,  typical  of  an  office 
automation  environment.  The  NIU  is  modeled  as  a  single 
server  queue,  where  messages  are  input  from  a  Data 
Termi.-iiil  Equipment  (DTE),  (e.g  ,  a  TTY,  a  minicomputer 
port,  a  word  processor,  a  digital  FAX  machine), 
packetized  appropriately  and  transmitted  to  another  NIU 
via  the  t'.SMA/CD  protocol.  Both  packet  and  message 
delays  ere  analyzed  among  various  types  of  DTE's.  Before 
proceeding  in  describing  the  mathematical  models  and 
presenting  the  experiments,  we  establish  in  a  rigorous 
manner  the  delay  definitions: 
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Packet  Delay;  From  the  time  the  first  sit  of 
a  packet  is  ready  to  be  sent 
at  the  source  MU  to  the  time 
the  last  bit  is  received  at  the 
destination  NIU. 

Message  Delay;  From  the  time  the  first  bit  of 
the  message  is  ready  to  be 
sent  over  the  source  DTE 
access  channel,  to  the  time 
the  last  bit  of  the  message  is 
received  at  the  destination 


(A  message  is  typically  chopped  into  several  packets 
which  may  be  interleaved  as  they  travel  through  the 
network). 


The  aggregate  class  i  packet  arrival  rate  summed 
over  all  DTE's  transmitting  class  i  packets,  is  denoted  by 
Xi-  It  follows  that  the  total  cumulative  packet  rate  S, 
normalized  to  T|  is 

S«Ti  r  XI 


S  is  related  to  the  total  offered  packet  traffic  C, 
including  retransmissions  due  to  collisions  and  busy 
conditions,  as 

Ce-*C 

5  s - 

(X  ♦  a)  Ce-*C  ♦  (1  ♦  aC)  (I  -e  *«C)2  ♦  I 


In  what  follows,  we  first  introduce  the  models  for 
the  packet  and  the  message  delay  analysis.  We  then 
describe  the  data-base  of  the  office  building  scenario  and 
subsequently  present  results  and  seruitivity  analysis. 

2.  MATHEMATICAL  MODELtNC 


Packet  Delay  Analysis 


where  X  is  the  packet  length,  weighted  by  the  class  mix 
and  normalized  to  Tj: 

£  XI  Ti 
I 

X  «  - 

Tlf  XJ 
1 


The  packet  delay  will  consist  of  the  transmission 
time  the  packet  spends  in  the  coaxial  cable  and  the  waste 
and  back-off  delays  due  to  busy  condition  or  collisions. 
We  follow  the  approximate  CSMA/CD  analysis  presented 
in  [14]  ,  adapted  to  reflect  various  classes  of  packet 
lengths  depending  on  the  application  type. 

Let  T|  be  the  transmission  time  of  packet  class  i, 
(including  all  link  level  overhead)  and  y  be  the  propogation 
delay  (in  nsec/ft).  Then  the  rtormaJi zed  propagation  delay 
relative  to  packet  size  of  class  I  is 

r.L 

a  * - 

Tl 

where  L  is  the  maximum  cable  length.  This  assumption  is 
a  pessimistic  orte,  since  it  implies  that  all  packet 
transmissions  will  encounter  the  maximum  possible 
propagation  delay. 


SMrvsL  or 
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FKiUHE  1:  ILLUSTRATION  OF  THE  AVERAGE  WASTED 
INTERVAL  DUE  TO  COLLISION  W 


The  packet  delay  per  class  i  will  be 
Dj  =  (A|  ♦  A2  ♦  Aj)  Tj 

when  A I  is  the  waste  due  to  collisions,  A2  the  dead  time 
due  to  the  back  off  algorithm  in  case  of  collision  or 
carrier  busy  and  as  the  propagation  and  transmission  time 
(all  normalized  to  T  | ).  It  can  be  shown  that 

A I  c  N2  (N  ♦  a) 


Aj  *  a  ♦ 


Ti 

Tl 


(jNjvl. 


Ri 

!)♦  (N|  -N2) - 

Tl 


where 


•e-aC 


\ 

(the  waste  due  to  collisions. 
Figure  1) 

(the  average  iHimber  a  packet 
has  to  back-off) 


N2  *  (1  *  •C)  e**G  -  I  (average  number  a  packet 
collides) 


R|,  R2  are  the  mean  retransmission  intervals  in 
case  of  busy  condition,  or  collision  in  which  case  the 
binary  back-off  algorithm  is  applied  as  in  [2] . 


Mess-tte  Delay  Analysis 


This  delay  will  consist  of  the  serialization  delay  at 
the  DTE  port,  the  protocol  execution  time  at  the  source 
and  destination  NIU,  the  CSMA/CD  packet  delays  as 
computed  above  and  queueirtg  delays  at  the  source  (  a 
packet  waits  to  be  transmitted)  and  destination  (packets 
wait  to  be  reassembled  into  a  message  to  be  transmitted 
over  the  serial  DTE  port). 
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Several  types  of  DTE's  will  be  considered,  each 
transmitting  or  receiving  various  classes  of  messages. 
Let 


Di  (j.  k) 

The  ETE  delay  of  message  class  i 
from  a  DTE  of  type  i  to  a  DTE  of 
typek 

Tn 

Protocol  execution  time 

Di 

Packet  delay  due  to  CSMA/CD 

TRi  (j) 

Class  i  packet  trar.smission  time 
and  type  j  DTE 

Mi 

Average  rwmber  of  packets  per 
message  of  class  i 

Qi'N(j) 

First  packet  queueing  of  class  1 
messages  on  access  queue  of  DTE 
type  j  port 

QiOUT(j) 

Packet  queueing  of  class  I,  at 
destination  queue  of  DTE  type  j 

port 


In  case  the  source  DTE  is  a  low  speed  DTE  or  host 
connected  via  multiple  parallel  ports,  then  Qil^fi)  =  0, 
since  no  queueing  is  practically  taking  place  at  these 
access  ports.  For  DTE's  connected  via  a  high  speed  port, 
the  NIU  may  have  to  buffer  irtcoming  messages  to  avoid 
overflow.  To  compute  Qi**^(i),  we  note  that  a  message 
will  be  input  to  the  NIU  as  a  whole.  Thus,  the  M/M/1 
formula  on  a  message  basis  can  be  used  (assuming  Poisson 
message  arrivals  and  exponented  message  length)  to 
compute  the  message  queueing  reflected  at  the  first 
packet. 

At  the  destintion  DTE  packets  belonging  to  a 
message,  will  not  follow  one  another  at  regular  intervals 
and  may  be  interleaved  with  other  message  packets.  Ve 
assume  that  received  packets  are  Poisson  distributed  and 
have  exponential  length.  Thus  the  M/M/ 1  formula  on  a 
packet  basis  can  be  used  to  provide  Qi^^T(i). 

To  compute  the  ETE  message  delay  we  distinguish 
between  two  cases. 

Case  I  -  The  delay  bottleneck  occurs  at  the 
destination  NIU.  This  will  happen  if  the  source  channel 
speed  is  higher  than  the  destination  channel  speed.  In  this 
case,  the  message  delay  will  be  dominated  by  the  packet 
queueing  and  transmission  time  at  the  destination  DTE 
port,  Mj  rQi®'-*T(k)  ♦  TRjfk)] .  Other  components  to  the 
ETE  delay  include  message  protocol  processing  27/^,  the 
CSMA/CD  delay  Dj  and  the  first  packet  queueing  and 
transmission  over  the  source  port,  as  illustrated  in  Figure 
2a.  It  follows  that 

Dj  (i,  k)»  QjlNfi)  ♦  TOi  (j)  ♦  2Tn  ♦  Di  ♦ 

♦  Mj  tQi°^^(k)  ♦  TRj  (k)] 


A.  aouacc  eoat  secco  >  destinatiom  eoat  speed 


SOimCE  PORT 

NETWORK 
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destination 


a.  SOURCE  PORT  SPEED  <  OESTMATIOH  PORT  SPEED 


FIGURE  2:  ILLUSTRATIQII  OF  ETE  MESSAGE  DELAY 


REQUIREMENT  ASSUMPTIONS 


The  models  above  have  been  used  to  evaluate 
delay/throughput  characteristics  for  a  typical  office 
building  LAN.  The  maximum  length  of  the  single  coaxial 
cable  was  assumed  $,000  feet  'enough  to  span  a  20  story 
average  office  building).  The  LAN  was  assumed  to 
support  1,000  terminals,  10  minicomputers,  25  word 
processing  centers  (each  including  several  CRTs  and 
floppy  disk  driversi,  and  20  digital  FAX  units. 
Communications  with  the  outside  world  were  supported 
via  a  high-speed  gateway  (l.5<i«  Mbps)  ^o  some  long-haul 
T I  -  rate  line. 

In  Tables  I,  2,  and  3  we  summarize  the  device 
characteristics  and  the  traffic  volumes  they  feed  to  the 
network,  the  message  types  and  tl>e  routing  of  messages 
both  internally  and  to  the  gateway.  Note  that  traffic 
numbers  are  purposely  exaggerated.  For  exarnpie  a  CRT 
terminal  is  assumed  to  generate  120  messages/hour,  each 
message  approximately  consisting  of  *0  characters.  This 
corresponds  to  an  operator  being  able  to  type  SO 
characters/mm.  continuously  without  waiting  for  a 
response!  Similarly  the  number  of  devices  in  the  building 
exceeds  current  office  automation  needs. 


Case  2  •  The  delav  bottleneck  is  the  source  NIU. 
This  occurs  if  the  source  channel  speed  is  lower  than  the 
destination  chanrtel  speed.  Here  the  message  residence 
(ETE  delay)  will  be  dominated  by  the  transmission  time  at 
the  source  port  Mj  TRj  (j)  as  in  Figure  2b.  Thus 

Dj  (),  k)  *QjlN(j)  ♦  Mj  TRj  (j)  ♦  2Tn  *  Dj  ♦ 

♦  (QjOUTfW  ♦  TR.fl,)] 


k.  NUMERICAL  RESULTS 

The  packet  and  message  delay  analysis  were 
implemented  in  a  computer  package  called  PLAN 
(Performance  of  Loral  Area  Networks).  This  program 
accepts  as  input  a  Data  Rase  on  requirements  in  a  format 
similar  to  Tables  I,  2  and  3  and  outputs  packet  and 
message  ETE  delays  per  device-type  pair. 
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TABLE  I;  DEVICE  CHARACTERISTICS 


Device  Type 

Access 

Number 

(Kbps) 

Message 

Speed 

(Msg/hour) 

Rate 

1. 

Gateway  (CW) 

1 

i,s«a.o 

• 

2. 

Word  Processing 

23 

9.8 

120 

3. 

Asynch.  Terminals 

iOO 

1.2 

120 

4. 

BSC  Terminals 

soo 

9.6 

120 

3. 

Mini  Computers 

10 

(••) 

(••) 

6. 

Low  Speed  FAX 

10 

1.2 

3 

7. 

High  Speed  FAX 

10 

9.8 

30 

(•)  The  gateway  traffic  is  evaluated  from  the  message 
rooting  information  (Table  3). 


(**)  Two  configurations  are  ootttidered; 

(1)  Multiple  Parallel  ports  per  mini 

-  File  trarofer  ports  at  Kbps, 

generating  3  messages  (files)/hour. 

Asynch.  ports  at  1.2  Kbps.  The  message 
volume  to/from  these  ports  is  computed 
from  the  routing  irtformation. 

-  BSC  ports  at  9.6  Kbps.  The  message 
volume  to/from  these  ports  is  computed 
from  the  routing  information. 

(2)  Single  Multiplexed  port  at  128  Kbps. 


TABLE  V.  ROUTING  ASSUMPTIONS 


From 

To 

WP 

TERMINAL 

FAX 

MINI  (file  XFER) 
MINI  (other) 

CW 

GW 

CW,  WP  (30%,  30%) 

CW,  MINI,  TERMINAL  (73%,  12.3%, 
12.3%) 

CW(|00%) 

CW,  MINI  (70%,  30%) 

TERMINAL  (symmetric  to  other 
direction) 

FAX  (3  times  the  other  direction) 
MINI,  TERMINAL,  WP  (symmetric) 

Two  classes  of  experiments  were  performed  using 
PLAN.  The  first  class  included  studies  on  effects  of 
r«etwork  load,  cable  speed  and  packet  length  to  the  packet 
access  (CSMA/CD)  delays.  The  second  class  consisted  of 
experiments  on  ETE  message  delays  and  their  sensitivity 
to  various  factors. 

A.  Experiments  on  Packet  Delay 

In  Figure  3  we  plotted  the  average  packet  delay  for 
the  data>base  of  section  3  (hereafter  referred  to  as 
reference  scenario)  under  various  packet  length  choices. 
Very  small  packet  sizes  (less  than  2  Kbits)  result  into 
considerable  message  fragmentation  and  consei^uently 
large  protocol  overhead.  On  the  other  hand,  larger  packet 
sizes  result  into  unfairness  (a  single  packet  will  take  over 
the  cable  for  a  prolonged  time),  need  for  large  buffer 
sizes  and  incompatibilities  with  the  device  access  method. 
Thus,  the  optimal  choice  for  a  packet  length  as  indicated 
in  Figure  3  is  t  Kbiu  above  which  no  significant 
performance  improvement  is  achieved.  Note  that  if  a 
measage  is  less  than  a  Kbits  it  will  form  a  single  packet 
for  transmission. 


TABLE  2;  NETWORK/TRAFFIC  PARAMETERS 


Packet  Header 
Cable  Capacity 
Cable  Length 
Retransmission 
Protocol  Processing 


238  bits 
I  •  10  Mbps 
3000  feet 
10  Packets 
•0  sec/bit  (•) 


Message  Class 

Length  (Information  Kbits) 

Terminal  Input 
Asynch  Output 
BSC  Output 

Word  Processing 
File 

FAX  (page) 

0.«8 

2.00  (•*) 
1.00  (•*) 
18.00 

200.00 

1,000.00 

*  As  assessed  in  . 

*  *  The  output  to  the  terminal  will  be  much  larger  than 

what  the  operator  types  in. 


nr-URE  3:  EFrCCT  CF  PACKET  LENGTH  (INFO  BITS) 
ON  THE  AVERAGE  PACKET  DELAY 
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In  Figure  the  packet  delay  is  given  as  a  function 
of  the  aggregate  load,  varying  up  to  3  Kbps  (the  cable 
capacity).  Our  reference  scenario,  generates 
approximately  300  Kbps.  Thus  the  cable  is  utilized  only 
at  10%  and  the  margin  for  growth,  insensitive  to  packet 
delays,  is  up  to  63%  utilization  (2  Mbps).  This 
demortstrates  that  the  access  method  is  not  a  significant 
factor  to  delays  under  a  wide  range  of  overload  scenarios. 
Note  that  our  reference  scertario  is  by  far  exaggerating 
traffic  volumes  in  a  typical  office  building. 


RCTaULi 

FIGURE  4:  EFFECT  Or  BOS  LOAD  ON  PACKET  DELAY 
B.  Experiments  on  Message  Delays 

In  Table  6  we  depict  the  ETE  message  delays  under 
the  parallel-port  and  single-multiplexed  minicomputer 
attachment  (as  expected,  the  single  high-speed  port  at 
128  Kbps  performs  better  than  parallel  ports  although  the 
basis  of  comparison  is  rather  weak).  Note  that  delay  can 
be  rather  large  (of  the  order  of  minutes)  due  to  the  device 
access  speed  and  queueing.  This  is  further  demonstrated 
in  Figure  3,  whereby  queueing  at  the  access  port  of  Word 
Processing  centers  (WP)  results  into  considerable  delay 
degradation  as  message  rate  increases.  Again  for  our 
reference  scenario,  and  its  vicinity,  delays  are  rather 
insensitive.  In  Figure  6,  we  show  the  dramatic  effect  of 
the  single-multiplexed  port  speed  on  minicomputer  traffic 
as  it  results  from  the  reference  scenario. 

Subsequent  experiments  showed  that  there  is  no 
significant  sensitivity  on  the  number  of  devices,  assuming 
that  the  aggregate  load  remains  constant. 

5.  CONCLUSIONS 

in  summary  we  conclude  that  the  CSMA/CO  access 
delay  is  of  the  order  of  miliseconds,  orders  of  magnitude 
under  the  message  ETE  delays.  These  are  due  mainly  to 
limited  device  access  speeds  and  queueing  at  the  serial 
port  at  the  OTE-NIU  interface.  Furthermore,  it  was 
established  that  coaxiiA?  cable  LAN's  can  support 
extensive  office  automation  requirements  up  to  six  times 
a  reference  scenario,  already  overloaded  without  any 
significant  performance  degradation. 


Note  that  the  LAN  technology  can  offer  a 
multiplicity  of  the  3  -  10  Mbps  capacity  either  via 
internetting  among  various  coaxial  cable  structures  or  by 
exploring  the  virtually  unlimited  capability  of  RF 
frequency  division  multiplexing  in  the  VHF/UHF  CATV 
bands. 


TABLE  «:  END-TO-END  MESSAGE  DELAY  (SEC) 
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FIGURE  5:  EFFECT  OF  MESSAGE  RATE  ON  END-TO- 
END  MESSAGE  DELAY:  WORD  PROCESSING 
AND  BSC-ASYN  TERMINALS 
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FIGURE  6:  EFFECT  OF  PORT  SPEED  ON  MINI-COMPUTER 
MESSAGE  ETE  DELAY:  SI NGLE  PORT 
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ABSTRACT 

This  paper  deals  with  analysis  and  design  issues 
encountered  in  the  development  and  evaluation  of 
microprocessor-based  interface  adapters  to  broadcast 
carrier  sensing  multiple  access  (CSMA)  local  computer 
networks.  Queueing  network  models  are  presented  and 
approximate  analytic  techniques  are  used  to  obtain 
throughput/delay  measures  associated  with  the 
interface  processors. 

In  the  first  part  of  the  paper,  the  protocol 
partition  issue  is  studied  as  it  is  implemented  via  two 
Stage  architectures.  It  is  shown  that  off-loading  link 
level  functions  to  the  terminal  end  CPUs,  improves 
dramatically  the  system  throughput  versus  partitions 
between  separate  intelligent  boards  at  the  terminal  and 
the  network  ends  of  the  interface  unit.  In  the  second 
part,  a  closed  queueing  network  model  is  used  to  study 
the  interface  throughput  under  window  flow  control.  It 
is  shown  that  the  main  limitation  is  due  to  protocol 
processing,  while  performance  is  unaffected  by 
irameters  referring  to  the  network  overall  utilization 
*  speed  within  a  wide  region  of  operation. 

INTRODUCTION 

Local  area  networks  (LANs)  are  rapidly  emerging 
as  a  major  and  distirKt  class  of 
computer  commurtications  networks.  Their  salient 
characteristics  make  them  particularly  attrar'^ive  as  an 
efficient  and  economic  solution  to  high  speed, 
unregulated  connections  within  a  limited  distance 
geographical  area  ^l] .  From  the  early  R&O  efforts,  it 
became  apparent  that  random  access  protocols  could  be 
more  efficient  for  high  speed  broadcast  media  over 
conventional  contention  resolution  techniques  used  in 
long  haul  networks.  This  led  to  carrier  sense  multiple 
access  protocols  (CSMA),  such  as  the  Ethernet  access 
scheme  [2] ,  employing  collision  detection  and  the 
hyperchannel  access  method  [3j ,  a  hybrid  between 
CSMA  and  polling.  Another  class  of  access  protocols 
use  token  passing  techniques,  a  variation  of  hub 
polling  W. 
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Various  design  issues  have  been  raised  by  LAN 
vendors  and  users.  Mainly  concerning  access  protocols, 
network  topology,  and  transmission  technology.  On 
access  protocols,  extensive  study  have  demonstrated 
relative  merits  of  the  two  prevailing  techniques,  CSMA 
with  collision  detection  and  token  passing,  e.g.  [5j  , 
[ft]  ,  and  [?]  .  Among  topological  designs,  bus  (tree) 
networks  are  preferred  for  CSMA  protocols  and  rings 
for  token  passing.  Finally,  a  major  "dichatomy"  on 
transmission  technology  exists  between  modulated  RF 
broadband  cable  transmission  (employing 
CATV  components)  and  baseband  tapping  on  a  passive 
coaxial  cable.  All  the  above  issues  could  be  classified 
as  referring  to  the  lower  protocol  functions_which  refer 
to  the  physical  layer  in  the  ISO  model  [8]  .  As  a 
consequence,  the  IEEE  LAN  Standards  Committee  [9] 
came  up  with  a  draft  proposal  which  includes  all  the 
major  physical  layer  options  (e.g.  bus,  CSMA/CD,  ring  - 
token  polling,  bus  -  t^en  polling)  and  builds  link  and 
network  level  standards  independent  of  the  physical 
layer.  This  recognizes  the  fact  that  other  protocol 
issues  may  become  more  important  than  the  access 
method  and  trinsmission  techtwiogy,  especially  in  view 
of  the  ample  bandwidth  (high  speed)  available  in  LANs. 

Immediately  related  to  the  protocol  function  in 
LANs  is  the  architecture  of  the  network  interface  unit 
(NIU),  within  which  most  of  the  the  lower  layers  are 
implemented.  These  multiple  microprocessor  units 
have  to  interlace  a  variety  of  data  terminal  equipment 
(DTE),  including  high  speed  terminals,  multi-vendor 
hosts,  etc.  They  differ  from  conventional 
communication  processors,  especially  on  the  very  high 
output  speed  (e.g.  10  .Mbps)  to  be  achieved  at  a  cost 
justifying  their  existence.  For  example,  on  a  per  p;.". 
basis  an  NIU  supporting  asynchronous  terminals  (OTEs) 
should  not  exceed  the  cost  of  a  short  haul  modem. 

It  was  long  suspected  that  throughput  analysis 
ignoring  processing  times  at  the  NIU  and  interaction 
between  higher  level  protocols  failed  to  identify 
potential  bottlenecks,  more  severe  than  the  limits  of 
the  access  schemes  and  the  medium  speed.  As  an 
example,  the  97%  Ethernet  utilization  reported  in  [10] 
ignores  the  limitations  of  the  NIU  architecture.  On  the 
other  hand,  simulation  results  on  Ethernet  type 
networks  with  different  buffer  management  and  speed 
mismatch  [l  ij  ,  showed  that  network  throughput  is 
very  sensitive  to  those  factors,  sometimes  reducing 
utilization  to  less  than  50%.  In  [l?]  we  presented  a 
detailed  queueing  network  model  of  two  microprocessor 
architectures  for  interfacinr  clusters  of  terminals  to  a 
coaxial  cable  based  CSMA/CD  network.  Using  queueing 
network  modeling  and  similar  ar,alytical  methods  used 
in  performance  evaluation  of  computer  systems,  we 
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studied  the  elfeci  of  the  NIIJ  on  end-to-end  character 
transfer.  Our  studies  showed  that  the  mam  limitation 
to  UAN  performance  is  due  to  the  interface  processing 
and  queueing,  while  its  sensitivity  to  the  cable  speed 
and  access  protocol  is  rather  limited.  In  |.l  3]  we 
further  demonstrated  via  approximate  analytic  tools, 
that  for  realistic  office  automation  scenario,  the  delay 
bottleneck  exists  at  the  OTE/network  interlace. 

These  studies,  apart  from  identifying  a  serious 
performance  limitation,  helped  us  to  acquire  a  valuable 
analysis/design  tool  to  study  NIU  architectures.  Their 
variations  in  the  design  variables  predict  weak  links  and 
often  system  parameters  which  are  important  to  the 
evaluation  of  network  performance.  In  this  paper,  we 
first  summarize  the  trade-off  between  the  two 
architectures  studied  in  [l  ^  i*  shown  that  clearly 
one  class  of  NIU  designs  is  superior  to  the  other.  An 
improved  architecture  (within  the  best  class)  is  then 
outlined  and  an  analytic  tool  is  presented  to  model  and 
evaluate  the  performance  of  such  designs.  Finally, 
sample  numerical  results  are  provided,  along  with 
sensitivity  studies  on  NIU  throughput  and  delays  or 
functions  of  critical  processing  times  and  protocol 
parameters. 

2.  TRADE-OFF  STUDIES  ON  TWO  NIU 

ARCHITECTURES 

In  most  implementations,  NIU's  consist  of  two 
basic  units:  (I)  the  terminal  interface  unit  (TIU)  and  (2) 
the  transceiver  unit  (TRU).  The  former  serves  the  user 
end  of  the  interface,  while  the  latter  the  network  end. 
in  our  model  we  assumed  that  each  NIU  includes  up  to 
four  TIU  boards:  every  TIU  can  service  two  to  eight 
FOX  ports  at  spMds  up  to  9.6  Kbps,  as  s.hown  in  Fig.  I. 
Each  TIU  has  a  dedicated  8  bit  NMOS  microprocessor 
with  a  I  microsec  minimum  instruction  cycle,  like  8085, 
6809,  or  Z80.  The  TIUs  and  the  TRU  are 
interconnected  via  a  parallel  master  bus,  having  a  speed 
equal  to  the  coaxial  cable  transmission  speed  (<•  or  10 
Mbps.) 

The  two  architectures  referred  to  as  architecture 
A  and  architecture  B,  differ  at  the  TRU  block: 

(a)  Hardware 

Type  A  TRU  has  a  CPU  and  memory  on 
board,  which  controls  the  DMA  transfer 
to/from  the  TIUs  and  an  HDLC  LSI  chip 
controller,  serviced  by  the  CPU. 

TRU  has  no  CPU  or  memory  on 
board.  The  DMA  transfer  to/lrom  TIUs  are 
controlled  by  the  TIUs  and  an  arbitration 
logic.  The  frame  functions  are  performed 
by  MSI  hardware  (such  as  CRC  and  flag 
generations,  earner  sensing,  etc.) 

(b)  Protocol  Level  Partition 

In  NIU  Type  A,  all  physical,  link,  and  partly 
network  level  function>  are  pei  formed  by 
the  TRU  with  the  TIU  performing  only 
packet  assernbly/disassembl)  (PAD) 
routines.  In  NIU  type  13,  tre  TRL'  is  merviy 
performing  physical  level  lurtctions  (except 
the  backoff  timer)  and  no  link  functioris. 
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2.1  NIU  Modeling 

In  order  to  analyze  the  performance  of  the  two 
architectures,  we  modeled  each  CPU  board  (the  TIUs 
and  the  TRU  for  NIU-A)  as  a  central  server  network 
with  priorities  and  feedback  passes  through  the  central 
queue.  The  internal  parallel  bus  of  a  CPU  boa'd.  whi-h 
is  exclusively  required  at  every  processing  pass,  is  the 
"commodity”  for  which  requests  are  queued-up.  As  an 
example,  consider  the  TIU  queueing  model  for  NIU-A  as 
give  in  Fig  .2.  Packets  from  the  terminals  enter  the 


BIU  at  rate  Ain  after  experiencing  the  serialization 
delay  Din  hold  the  parallel  bus  (or  the  duration  of  the 
input  interrupt  processing.  A  second  pass  through  the 
bus  models  the  PAD  processing  functions  and  then 
packets  reside  in  RAM  waiting  to  be  fetched  by  the 
DMA  controller.  If  the  DMA  transfer  is  done  on  cycle 
stealing,  it  will  be  transparent  to  the  bus  functioning 
apart  from  slowing  down  background  processing. 
Otherwise,  the  DMA  transfer  is  modeled  as  a  third  pass 
with  pre-emptive  priority  over  background  processing. 
Similar  passes  will  occur  on  packets  routed  from  the 
TRU  to  one  of  the  TIU  serial  ports.  For  a  detailed 
specification  of  the  packet  (low  in  the  various  stages 
and  the  processing  times  of  each  pair  through  the 
central  server  the  reader  is  referred  to  [l  l]  . 

To  integrate  two  various  stages  into  the  NIU 
model,  two  approaches  are  taken  depending  on  the 
architecture: 
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For  NIU  A  -  ihe  queueing  networks  represeittinq 
TIUs  and  TRII  are  interconnected  via  a  polling 
mechanism  ivhich  simulates  the  DMA  controller, 
as  in  Fig.  3-A. 

For  NIU  B  -  the  TRU  is  modeled  as  a  FCFS  queue 
with  service  rate  equal  to  the  coaxial  cable  speed. 
The  server  will  be  disabled  (switch  in  ol(  position. 
Fig.  )-B)  if  the  carrier  on  the  network  is  sensed 
busy.  The  BIUs/TRU  coupling  via  arbitration  is 
also  modeled  as  a  FCFS  mechanism. 


2.2  Results  and  CorKlusions 


The  models  were  used  to  run  experiments  and 
perform  sensitivity  studies.  When  the  two 
architectures  are  lightly  leaded,  they  provide  similar 
delay  performarKC.  As  traffic  increases  NIU>A 
saturates  at  the  TRU  CPU  for  link  level  processing. 
For  NIU*B,  saturation  occurs  due  to  contention  among 
TIUs  to  access  the  TRU  pipeline. 

For  the  set  of  parameters  used  and  for  packets 
including  only  I  character  of  information  (10  bits),  the 
analysis  showed  (Fig.  a)  that  NIU'B  saturated  at 
terminal  traffic  of  ),000  characters/min  whereas  NIU- 
A  saturated  at  ISO  characters/m  in  only.  Clearly.  NIU- 
A  suffers  from  the  separation  of  link  protocol  functions 
between  T1U>TRU  in  series  and  the  single  TRU  is  the 
bottleneck  of  the  system.  NIU-B  achieves  the  highest 
possible  degree  of  parallel  processing  among  TIUs.  with 
the  TRU  acting  at  a  mere  transceiver.  Thus  results  the 
IS-fold  improvement  over  NIU-A.  The  tingle  character 
per  packet  requirement  was  imposed  from  real  ectio 
FOX  specifications  and  rendered  throughput  r-aiculation 
quite  pessimistic.  The  results,  however,  are  very  useful 
(or  comparative  purposes. 
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FIGURE  4-  COMPARATIVE  NIU  PERFORMANCE 


Another  interesting  study  concerns  the  sensitivity 
on  NIU  throughput  and  delay  from  variation  ol  ilie 
network  (coaxial  cable)  utilization.  As  seen  in  Fic.  A, 
the  NIU  performance  is  almost  insensitive  to  network 
utilizations  from  10  up  to  A0%.  Higher  utilizations 
result  into  excessive  carrier  busy  and  collision 
conditions  and  the  performance  of  the  system  degrades. 

In  conclusion,  it  appears  that  the  main  limitation 
on  LAN  performance  is  due  to  interface  processing  and 
queueing,  while  its  sensitivity  to  the  cable  speed  and 
access  protocol  is  rather  limited.  Furthermore, 
performance  is  greatly  affected  by  the  protoco! 
partition  among  TIU  and  TRU  with  bottlenecks  arising 
if  the  three  lower  layers  (physical,  link  and  netw'O'k) 
are  allocated  in  both  ends.  A  pipeline  TRU  requiie-. 
customized  hardware  and  lacks  the  flexibility  m 
adapting  to  different  access  protocols  through 
software,  but  exhibits  impressive  improvement  on  the 
NIU  throughput.  In  the  next  section,  wc  show  how  to 
approach  the  modeling  and  analysis  of  a  more 
K'phistiCBied  version  of  NIU>B,  not  limited  to  smyl'* 
character  packets  and  incorporating  wi.ndnw  How 
control  to  keep  memory  sizes  within  reasortable  limits. 
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J.  ANALYSIS  OF  AN  IMPROVED  NIU 

architecture 

The  trade-off  studies  reported  above 
demonstrated  the  merit  of  offloading  the  TKU  from  all 
link  level  functions,  except  those  performed  by 
hardware  at  the  speed  of  the  network  (CRC  checking, 
(lag  generation  and  incoming  packet  address  filtering). 
The  models,  however,  were  of  limited  practical  use. 
since  they  assumed  single  information  character  per 
packet  and  did  not  model  flow  control  techniques  at  the 
network  level,  in  order  to  alleviate  buffer  overflow  and 
speed  mismatching.  In  what  follows,  we  will  highlight 
an  improved  NIU  architecture  which  implements 
variable  packet  sizes  and  end-to-ertd  flow  control. 
Then  we  present  a  queueing  network  model  and  an 
analytic  technique  to  obtain  throughput/delay 
measures.  The  analysis  is  applied  to  study  NIU 
maximum  throughput  and  its  sensitivity  to  various 
design  parameters. 

).l  Architecture  and  Simplifyine  Assumptions 

We  consider  an  NIU  interfacing  data  terminals  to 
a  broadcast  CSMA  cable.  As  before,  the  NIU  consists 
of  a  number  of  parallel  TIU  CPU  boards  (up  to  four)  and 
a  TRU  with  no  CPU  or  memory  on  board.  The  DMA 
transfer  between  TIUs  and  TRU  is  performed  via 
arbitration  on  a  master  bus  at  speed  equal  to  the 
coaxial  cable  speed  (typically  10  Mbps).  Cycle  stealing 
is  assumed  although  burst  transfers  could  be  modeled  as 
well. 

Each  TIU  provides  a  number  of  serial  ports  for 
terminal  interfacing.  Typically,  the  number  of 
terminals  supported  per  NIU  varies  from  eight  ii.S  Kbps 
terminals  to  four  9.6  Kbps  and  one  56  Kbps  high  speed 
OTE.  As  in  Fig.  5,  a  task  queue  controls  the  CPU 
processing  from  the  I/O  interrupt  to  packet 
assembly/disassembly,  address  formation  and  control 
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field  setting.  The  completed  data  packet,  with  its  "TIU 
header".  Fig.  6,  will  then  reside  in  RAM  awaiting  lor 
DMA  transfer.  The  TIU  processing  time  will  be  given 
approximately  by  the  linear  formula. 

TIU  Processing  :  p  *  q*  N 

Where  p  is  a  fixed  processing  time  per  packet,  N 
is  the  number  of  information  byte  per  packet  and  q  the 
interrupt  processing  time  per  bit.  Typically,  q  =  0.01 
msec/bit  and  p  :  10-70  msec  per  input  data  packet  or 
p  :  5  -  55  msec  for  ACKs  and  output  packets.  Here, 
input  refers  to  the  DTE-to-network  direction  and 
output  to  the  network-to-DTE. 


Terminal  data  are  packelized  upon  occurrance  of 
either  of  the  following  events: 

(1)  carriage  return  of  other  control  character. 

(2)  buffer  space  full  (typically  2048  bits) 

())  no  new  character  arrived  from  a  DTE  within 
a  certain  window 
(typica)ly  10  to  50  msec.) 

In  the  case  of  terminal-to-computer  interactive 
mode,  which  is  the  basis  of  our  model,  the  information 
field  of  host  response  packets  is  usually  ten  times 
longer  than  a  terminal  query.  In  our  analysis,  we 
assume  that  terminal  packets  are  exponentiallv 
distributed  with  mean  10-100  bits  (I -10  characters)  and 
response  packets  are  exponential  with  mean  100-1000 
information  bits.  These  numbers  are  consistent  with 
measurements  on  9.6  Kbps  terminals  performing 
interacting  computing. 

The  DMA  cycle  stealing  mechanism  is  assumed  to 
slow  down  the  CPU  activity  by  a  factor  of  50%  during 
actual  data  transfer.  We  ignore  D.MA  controller  turn¬ 
around  and  initialization  times  by  assuming  that  each 
DMA  transfer  direction  is  associated  with  a  separate 
DMA  channel.  Hence  reinitialization  is  done  during 
transmission  on  the  opposite  direction.  It  may  be 
desirable  to  have  direct  memory  access  between 
adiacent  TIU's  and  implement  memory  shared  parallel 
processing.  This  option  is  not  incorporated  in  our 
present  performance  analysis. 

Date  packets  (including  the  TIU  40  bits  overhead. 
Fig.  6),  are  processed  at  the  TRU,  where 
synchronization  bits,  (lags  and  CRC  bits  are  generated 
at  the  coaxial  cable  speed  (l-iO  Mbps).  The  TRU 
overhead  (48  bits)  is  illustrated  in  Fig.  6.  The  TRU  is 
responsible  (or  sensing  the  carrier,  deferring 
transmission  upon  busy  condition  and  retry  after  a 
random  retransmission  period.  In  our  analysis,  we 
assumed  that  the  random  interval  has  mean  5  msec.  No 
collision  detection  was  modeled  although  it  could  be 
simply  incorporated  in  the  model. 
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Collided  or  otherwise  destroyed  packets  would  be 
detected  vie  the  ecknowledRemeni  rnechanum,  of  the 
link  protocol,  in  Fig.  7  a  logical  link  is  established 
through  the  network  between  interacting  OTEs.  On 
that  link,  reliable  exchange  is  guaranteed  by  an 
acknowledgement  (ACK)  mechanism.  Each  query 
packet  (or  group  ol  packets)  is  answered  by  a  special 
ACK  packet  (or  negative  ACK  m  case  of  error  or 
collision)  or  an  ACK  bit  piggybacked  on  information 
packets  flowing  in  the  opposite  direction.  The  window 
is  the  maximum  number  of  outstanding  packets 
awaiting  ACK's.  In  this  paper,  we  considered  only  the 
case  ol  window  size  one.  In  other  words,  the  protocol  is 
a  stop>and-wait  scheme,  whereby  no  packet  is 
transmitted  until  the  previous  one  is  correctly 
acknowledged.  Negative  acknowledgements  would 
result  in  retransmissions  of  the  same  packet.  We 
assumed  that  the  delay  to  receive  a  negative  ACK, 
automatically  introduces  the  random  deferment  needed 
in  random  access  schemes.  Retransmitted  packets 
follow  the  same  route  from  the  TIU  RAM  to  the  TRU 
transfer  via  DMA.  The  assumption  of  a  single  packet 
window  limits  the  maximum  throughput  of  the  system. 
It  is  however  a  simple  technique,  especially  suitable  for 
interactive  applications  where  the  network  induced 
delay  is  orders  of  magnitude  smaller  than  user  think 
time  and  query  processing  at  the  host  end.  An 
extension  to  larger  window  sizes  can  be  easily  derived 
using  the  same  modeling  methodology. 


3.2  Queueing  Network  Model 

The  NIU  architecture  is  modeled  as  a  closed 
queueing  network  using  the  methodology  outlined  in 
^4]  .  The  finite  population  of  "jobs"  travelling  within 
the  network,  equals  the  number  of  terminals  connected 
to  the  NIU.  These  jobs  will  be  either  packets  under 
processing  at  the  TIU  CPU,  packets  waiting  to  be 
transmitted,  ACKs  travelling  back  to  the  NIU,  host 
originated  responses  to  be  transmitted  to  the  terminals, 
or  ‘'permits"  at  the  user  terminals,  whereby  a  response 
has  been  received  and  the  terminal  is  either  in  the 
process  of  serializing  a  new  packet  or  simply  in  a 
"thinking”  state.  Each  user  has  a  corresponding  "job"  in 
one  of  the  above  states. 

In  Fig.  8,  the  queueing  network  is  illustrated  for 
two  TIU's  per  NIU,  each  interfacing  two  terminals.  The 
terminals  (small  circles  to  the  left),  have  a  pure  delay 
associated  with  them,  representing  the  lime  needed  to 
input  a  new  packet,  when  no  other  packet  Irom  this 
user  waits  at  the  TIU  buffer.  This  delay  will  be  zero  m 


case  of  heavy  users  who  do  not  wait  for  a  response  to 
generate  a  new  query.  In  some  interactive 
environments,  users  do  not  fill  the  interface  buffer 
until  they  see  the  echo  from  the  previous  packet.  In 
such  case,  the  "idle”  time  irtcludes  serialization  of  input 
and  output  characters,  at,  user  think  time.  Due  to  this 
excessive  delay,  piggybacking  ACK's  is  impossible  and 
stand>alone  ACK's  are  used.  in  o>jr  analysis  we 
assumed  heavy  terminal  users  and  mostly  piggybacked 
ACK's  in  order  to  study  maximum  thoughput  scenario. 
Obviously,  maximum  throughput  is  achieved  when  there 
is  always  at  least  one  packet  ready  to  be  transmitted 
upon  receipt  of  a  response  and  the  pure  delay  at  the 
terminals  is  zero. 

The  CPU  processes  a  mix  of  packets  consisting  of 
ACK's,  input  and  output  packets.  It's  service  time  is 
slowed  down  due  to  the  DMA  cycle  stealing. 

The  control  queue,  serves  all  "ready"  packets. 
Due  to  the  FCFS  nature  of  the  DMA  arbitration 
mechanism,  we  include  all  TIU  RAM's  within  a  single 
buffer,  although  this  is  not  physically  the  case.  The 
same  mix  of  packets  are  served  as  in  the  TIU  CPU,  with 
rate  equal  to  the  DMA  transfer  (or  the  coaxial  cable 
speed).  However,  due  to  the  CSMA  protocol,  the  server 
is  modeled  with  an  ON/OFF  switch,  representing  the 
unavailability  of  service  when  the  carrier  is  sensed 
busy.  The  switch  will  increase  the  effective  service 
rate  by  introducing  retrial  intervals  weighted  by  the 
probability  of  busy  condition  (see  Appendix  2).  Ue 
assume  that  the  reserving  interval  is  random  with  a 
mean  of  >  msec. 

Upon  completion  of  service  at  the  DMA  queue, 
host  responses  and  ACK's  are  sent  back  to  the  CPU.  As 
explained  above,  a  user  data  packet  will  generate  either 
a  rtegative  ACK  on  collision  or  a  positive  ACK 
(stand-alone  or  piggybacked  within  host  response 
packets).  ACK's  will  be  generated  at  the  destination 
end  after  a  delay  equal  approximately  to  C  ^  ^-')'> 
msec.  Host  response  packets  will  be  either 
immediately  transmitted  (if  they  already  wait  in  RAM) 
or  need  to  be  generated  alter  some  random  "idle*  time. 
In  Fig.  S,  ACK  generation  and  host  response  times  are 
modeled  as  pure  (non-queueing)  delay  elements.  For 
maximum  throughput  studies,  we  assumed  that  all 
positive  ACK's  are  piggybacked  on  host  packet  ready  to 
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be  transmitted.  Thus  in  heavy  traffic  scenario,  Tq  =  Ty 
5  0,  or  users  and  hosts  nenerjie  i>ai  kcts  witliin  the  lime 
delay  of  the  flow  control  window. 

An  exact  analysis  of  the  closed  queueing  network 
of  Fig.  8,  cannot  be  obtained  due  to  the 
non-exponential  mix  of  queues  I  and  2.  Reasonable 
results  may  be  obtained  via  the  FCFS  approximations  in 
mean  value  analysis  found  m  [l .  The  network  is  a 
multichain  model,  where  each  Tl'J  board  corresponds  to 
one  chain  with  chain  population  equal  to  the  number  of 
terminals  per  TIU.  In  Appendix  I,  we  provide  further 
details  on  the  analytic  technique  as  implemented  for 
the  single  chain  (one  TIU/TRU)  simplified  version 
reported  in  ).3. 

3.3  Numerical  Examples 

The  modeling  and  analytic  technique  presented  in 
3.2  was  implemented  for  a  single  TIU  per  NIU 
configuration  as  in  Fig.  9.  Several  parallel  TIU's  per 
NIU  can  definitely  be  studied  as  well,  but  since  most  of 
the  existing  LAN  interfaces  do  not  support  such 
parallelism  for  cost  considerations,  we  preferred  to 
limit  this  study  to  presently  relevant  architectures. 
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A  highlight  of  the  algorithms  used  in  our  analysis 
is  given  in  the  two  Appendices.  Our  design  parameters 
were  set  initially  to  the  values  reported  in  3.2;  more 
specifically  we  assumed: 

Number  of  Terminals  per  NIU:  U  :  S 
Cable  Speed:  C  :  10  Mbps 
Input  Data  Packet:  X,n  ^  100  bits 
Output  Data  Packet:  :  1000  bits 

TIU  Overhead:  40  bits 
TRU  Overhead:  44  bits 

Processing  Time  per  Input  Packet:  p  :  10  msec 
Processing  Time  per  ACK/Output  Packet  x  p/2 
Interrupt  Time  per  bit  x  0.01  msec 

Carrier  Resensing  Delay:  )  msec 
DMA  Cycle  Stealing  Fraction:  30% 

To  assets  the  CSMA  parameters,  we  also  need  the 
network-wide  cable  utilization  s  and  the  cable 
propagation  delay  a.  We  assumed  that  these  were  not 
affected  by  the  individual  NIU  traffic  variations 


(except  changes  on  the  packet  sire  which  are  .issmned 
to  be  universal).  This  is  a  reasonable  assiiinpiiun  tor  a 
broadcast  LAN  with  200  NIU's  connected  to  it,  each 
with  different  throughput.  The  s  and  a  parameters 
were  taken  as: 

s  X  22% 

a  X  1.3)  microsec. 

In  Fig.  10,  we  present  the  NIU  throughput  and 
delay  variations,  as  the  terminal  and  host  "idle"  times 
vary.  Note  that  by  NIU  throughput,  it  is  meant  the 
aggregate  amount  of  information  bits/sec  flowing  In 
both  directions.  By  NIU  delay,  we  mean  the  additional 
time  a  terminal  generated  data  packet  has  to  wait  for 
transmission  due  to  NIU  processing.  It  is  apparent  from 
the  figure,  that  maximum  throughput  values  are 
obtained  with  very  short  idle  times  (packets  always 
wait  in  memory  to  be  transmitted  as  soon  as  the 
window  opens).  The  effect  of  the  TfiJ  processing  time 
p,  IS  dominant  in  heavy  traffic  conditions,  ^s  idle 
times  increase,  the  throughput  is  limited  by  external 
(actors  rather  than  the  NIU  bandwidth.  Similarly 
delays  depend  critically  upon  p. 
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Network  parameiert  were  shown  to  have  a 
minimal  effect  on  NIU  throughput.  Specifically,  with 
cable  speeds  I  Mbps  and  10  Mbps,  maximum  thruui;hput 
values  were  46.46  Kbps  and  47.06  Kbps  respectively,  lor 
the  same  network  utilization.  In  Fig.  II,  we  plotted 
the  maximum  NIU  throughput  versus  network 
utilization.  For  utilizations  less  than  )0%,  NIU 
performance  was  basically  insensitive  to  s.  The 
instability  at  utilizations  higher  than  60%,  is  due  to  the 
CSMA  saturation. 


Firtally,  in  Fig.  12,  we  studied  the  effect  of 
packet  sizes  on  NIU  throughput,  NIU  delay  and 
"round-trip"  delay  (the  time  elapsed  from  closing  the 
window  at  the  terminal  NIU,  till  a  response  on  ACK 
from  the  host  reopens  it).  It  is  important  to  note  that 
although  the  throughput  iiKreases  with  the  packet  size 
(since  more  data  bits  experience  the  fixed  processing 
time  per  packet),  the  curve  approaches  some  limit, 
whereby  the  bit  interrupt  processing  and  cable 
transmission  delay  offset  the  advantage  gained  from 
larger  packet  sizes.  Note  that  the  packet  size  is  mainly 
limited  by  other  considerations,  such  as  buffer  size  and 
packetization  laterKy. 

4.  SUMMARY  OF  CONCLUSIONS 

A  conclusion  consistent  with  experience  of  LAN 
users  and  vendors  is  that  the  network  parameters  do  not 
greatly  affect  throughput/delay  performance,  if  the 
cable  is  utilized  below  saturation  (e.g..  60%).  The 
limiting  factor  in  most  cases  reside  at  the  interface 
unit,  where  bartdwidth  can  be  much  lower  than  the 
network  capacity  to  carry  data. 

In  analyzing  the  internal  NIU  behavior,  we  first 
identified  that  protocol  partition  within  the  NIU  is  an 
important  design  factor.  Specifically,  execution  of  the 
link  protocol  at  the  tranceiver  unit  by  a  centralized 
CPU  for  all  terminal  interface  boards,  appear  highly 
inefficient.  On  the  other  hand,  an  architecture 
whereby  the  terminal  interface  units  control  their 
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associated  links,  with  the  tranceiver  performing  basic 
transmission  functions  in  hardware,  achieves  a  high 
degree  of  parallel  processing  and  alleviates  the  control 
bottleneck. 

The  second  part  of  the  paper,  dealt  with  modeling 
and  analyzing  an  improved  NIU  architecture.  The 
window  flow  control  of  the  end-to-end  link  was 
modeled  via  closed  queueing  network  methods  and 
approximate  analysis  were  performed.  It  was  found 
that  the  NIU  throughput  was  very  sensitive  to  the 
protocol  processing  delays,  where  as  cable  speed  and 
utilization  do  not  have  any  obvious  effect  in  normal 
operating  conditions.  An  important  design  parameter 
(apart  from  the  CPU  speed),  is  the  packet  size  wnh  the 
trade-off  being  between  throughput  and  builer  size. 

The  above  results  were  obtained  (or  a  simple  NIU 
architecture  (one  terminal  interface  unit  per  MU)  a>id  a 
stop-and-wait  (window  I)  protocol.  Immediate 
extensions,  include  more  than  one  parallel  boards  Dcr 
NIU  and  window  sizes  greater  than  1.  Further  areas  of 
research,  include  modeling  the  sources  (terminals)  rrior-.- 
precisely,  assess  quantitatively  buffer  size 
I'equircments  and  improve  the  approximate  analysis  on 
the  FCFS  non-exponential  queues. 
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In  order  to  compute  the  first  and  second  moments 
of  the  service  times  at  queue  I  and  2,  we  define  the 
probabilistic  breakdown  of  the  tralfic  mix  and  weight 
means  and  second  moments  ol  various  components 
appropriately.  As  an  example,  for  queue  I,  we  have 
ACKs  short  (input)  and  long  (output)  data  packets. 

At  queue  one,  the  service  time  must  also 
include  the  DMA  cycle  stealing  effect.  Thus,  if  i$ 
the  service  time  without  DMA  overhead  and  Utru 
the  TRU  utilization  (fraction  of  time  data  are 
transmitted  or  received). 

*  ■  I  -  0.3UTRU 

Here,  the  cycle  stealing  is  assumed  to  comsune 
30%  of  the  CPU  power.  The  utilization  ol  the  TRU  is  a 
(unction  of  the  NIU  throughput.  This,  however,  is  not 
krwwn  a  priority  and  an  iterative  procedure  is  used, 
whereby  an  initial  run  is  performed  assuming  Utru=  0 
and  subsequently  new  iterations  correct  T|  until 
convergence  is  reached.  In  our  numerical  examples,  the 
procedure  converged  within  1-2  iterations. 

At  queue  two  the  service  time  T2  must  iuclude 
the  effect  of  CSMA  carrier  busy  delays.  Let  Pj  be  the 
probability  the  carrier  is  sensed  busy  as  computed  in 
Appendix  2  and  Oj  the  mean  interval  at  which  the  TRU 
retries  to  sense  the  cable.  If  this  interval  is  uniformly 
distributed  and  ri  is  the  service  time  with  Pj  :  0,  we 
have  that 

T2  «  ♦  Nj  Dj 

E(T22)  .  E(T^2)  ♦  J  Dj2  N,2  (i  ♦  Nj) 

where  N, »  « 

•  .  T  number  of 

'  "  retrials 

The  throughput/delay  evaluation  for  the  finite 
population  network  of  Fig.  9  follows  the  single  chain 
mean  value  analysis  for  closed  queueing  networks  as 
reported  by  Reiser  in  [l .  Due  to  the  non-exponential 
servers  rto  product  form  solution  exists  and  the  FCFS 
approximate  technique  of  [l!^  is  used.  The  input 
consists  of  To,  Ti,  E(Ti2),  T2,  £fT2^),  Tj,  Ti,  and  the 
average  number  of  retransmissions  of  collided  packets, 
Nc. 

The  algorithm  proceeds  as  follows: 

1.  set  w  I  I,  A(0)  .  X|  X  A2  =  ttl  -  h2  =  0 

2.  H  X  Ti  *  ri  (ni -XT|)  ♦  0.5  Ai  E(^l 

*2  »r  2  ♦  ^2  <*'2  •AF2)  ♦  0.5  A2 

3.  t.  To  *  Nc  T3  *  (2  «  Nc)  i|  *  2(1  ♦  Nc)  t2 

A(w)  «  w/t 

Ai  X  (2  ♦  Nc)  A(w) 

A2  X  2(1*  Nc)  A(w) 

"1  *  <|Ai 

"2  «  «2A2 

#.  IF  w  X  K  (the  number  of  terminals/NIU),  STOP; 

ELSE,  w  X  w*l 

END, 


We  used  the  pure  CSMA  throughput  formulation 
as  reported  in  [16]  .  It  assumes  that  Poisson  sources 
(NIUs)  connected  to  the  cable  transmit  packets  of 
length  d  sec  at  a  rate  S  packets/sec.  The  packet 
length  is  computed  as  the  average  among  input,  output 
and  ACK  packets.  The  cable  propagation  delay  was 
assumed  a  =  l.)6  microsec.  If  the  total  scheduled  rate 
C  (including  unsuccessful  and  collided  attempts)  is 
Poisson,  the  following  formula  applies: 


C(2a  ♦  d)*e-*C 


Pr (success} 


the  probability  of  collision.  Pc  is  given  by: 
p  I  -  e~aC 

*  C(2a  ♦  d)*e-»^ 

and  the  probability  ol  busy  carrier  Pj  by 


I  -  Pe  -  ■ 
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